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This pcoject developed and tested a model of human question answe».ing (called QUEST). 
QUEST accounts foe the answers that adults produce when they answer different categoriei 
of open-class questions, such as why, how, when, and what-if . QUEST identifies the 
information sources for questions and assumes that knowledge is organized in the form 
of conceptual graph structures containing statement nodes and relational arcs. 
Example types of structures inci Je goal hierarchies, causal networks, taxononic 
hierarchies, and spatial partonomies. Question answering procedures operate 
systematically on these knowledge structures- An irt^ortant property of QUEST consists 
of three convergence mechanisms which narrow down the node space from do7,ens/hundreds 
of nodes to a handful of nodes that serve as good answers to a question. First, an 
arc search procedure restricts its search to particular paths of relational arcs, 
depending on the question category; nodes on legal paths are better answers than nodes 
on illegal paths. Secc^d, answer quality (^creases as a function of structural 
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distance (i.e./ number of aces) between the queried ncsde and the answer node. Third, 
a constraint satisfaction component prunes out potential answers that are conceptual- 
ly incompatible with the queried node. QUEST al3o contains a pragmatic component 
which conaidecs the goals and common ground of the speech participants. 

C?UEST was tested in the context of expository text on scientific mechanisms, 
narrati-^e textf and generic concepts. The nvodel successfully predicts the likelihood 
of generating particular answers to questions and "goodness~ot-answer" judgements 
(and latencies) for particular question-answer pairs. QUEST can also account for 
answers produced in pragmatically complex contexts^ such as telephone surveys^ 
televised interviews^ and business transactions. 



00 form 1471. JUN 86 ' . . 



APPROVED FOR PUBLIC RELEASE: 
DISTRIBUTION UNLIMITED 



Question Answering 1 



ABSTRACT 

This pro^ developed and tested a model of human cn^estion ansvirerlng (caHed QUEST). 
QUEST accounts fof the answers that adults produce when they answer different categories of 
open-cl^ questions, such as why. how. when, and wt^-if . QUEST Identifies the hifoimatton 
sources for questions; the primaiy inform^lon sources are associaJed with the content wowts in 
questioi^ (i.e., nouns, main vefts, adjectives). Each information source is organized in the fomi 
of a conceptual graph stmcture that contains nodes and relational arcs. Example types of 
stfudures are goal hierarchies, causal networks, taxonomic hierarchies, and spatial partonomias. 
Question answering procedures operate systematically on these conceptual graph stnjdures. 

An impoftar^ property of QUEST consists of three convergence mechanisms that narrow down 
the node spa» from dozens/hundreds of nodes to a handful of nodes which serve as good 
answers to a questton. Rrst, an arc search procedure restricts its search to particular paths of 
relational arcs, deperKling on the question category; nodes on legal paths are better answers 
than nodes on illegal paths. Second, answer quali^ decreases as a function of stmctural 
distance. I.e.. the number of arcs between the queried rwde and answer mtie. Third, a constraint 
satisfaction component prunes out potential answers that are conceptually incompatible with the 
queried node. QUEST also contains a pragmatic component which considers the goals and 
common grtJund of speech participants. 

QUEST was tested in the context of expository texts on scientific mechanisms, narrative texts, 
and generic concepts. Tne model successfully predicted (a) the li elihood of generating 
particular answers to questions and (b) "goodness-ot-answer judgments for particular quesJion- 
aa^er pairs. QUEST can also a<xount for answens produced in conversational contexts that 
have more complex pragmatic conslrainls. such as telephone sun^eys. televised interviews, and 
business transactions. 
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INVESTIGATIONS OF HUMAN QUESTION ANSWERING 

The puipose of this project was to investigate how aduHs answer open-class questions, such as 
Why. how. when and what-if questions. We examined the Knowlec^e structures that furnish 
answers to questions and the cognitive procedures that <»nverge on appropriate and relevant 
answe-^ to part«ulaf questions. This final repon on the ONR project has three major parts. First 
we provide a bnef introductfon on question answering research, setting the stage for the research 
to be reported- Second, we descritw QUEST, a model of human question answering that we 
have developed. Third, we report a series of experiments that test the QUEST mode! in the 
context of expository text, nanative text, generc concepts, and naturalistic conversation. 

Introduction 

Que^fon answering is a very important-activity during knowledge acquisition, communication and 
social interaction. In spite of this, question asking and question answering have rarely been direct 
objects of inquiry in the cognitive sciences. Linguists have analyzed the syntactic form of 
questions, but rarely have attempted to explain the content of answers and the world knowledge 
that supplies the content. Researchers in education have analyzed the extent to which lext 
acquisition is influenced by adjunct questions, i.e., questions that are placed at the beginning in 
the middle, versus at the end of a text, but they have not considered the process of question ' 
answering. 

Cognitive psychologists have investigated the process of answering closed-class questions at 
OTnsKJerabte depth (Clark & Clark. 1977; Glucksberg & McCtoskey. 1981; Reder. 1982, 1987; 
Singer. 1 9B6, in press). Appropriate responses to closed-class questions are restricted to a 
limited rwmber of alternatives anj nonnaHy are short answers. For example, answers to 
venlication questions are yes. qq, maybe, and I dont know . Closed-class questions are 
contrasted with open-class questions which invite replies with elaborate verbal descriptions e g 
why. how. when, and what-i! questions. Progress in understanding open-class questinns has 
been conparatively slow in psychology (Collins. Warnock. Aiello, & Miller. 1975; Graesser & Black 
1985: Graessera Golding. igsS; Norman. 1973; Norman & Rumelhart, 1975; Piaget 1952 
Shanon. 1983; Trabasso, van den Broek, & Lui, 1988). 

The fiekjs of artificial intelligence and computational linguistics have furnished detailed models of 
question answering that account for the content of answers and the world knowledge that 
supplies ir.is content (Allen, 1983: Bmce. 1982; Dahfgren. 1988; Dyer. 1983; Kaplan. 1983- 
Lehnert. 1978; Lehnert. Oyer. Johnson. Young, & Harley, 1983; McKeown, 1985; Souther 
Acker, Lester, & Porter. 1969; Woods, 1977). in most of these models, text and world knowledge 
are organized in the form of a stnjctured database. Question answering (Q'A) procedures access 
these information sources and search through the structures systematically. The formalisms and 
insights from these fields obviously nujst be tested in psychological experiments before we can 
incorporate them into psychological models of human question answering. One objective of this 
ONR contract was to test some of these fonr.alisms and insights. 

The devetopment of the QUEST model was influenced by existing Q/A models in cognitive 
psychology, artificial intelligence, and computational linguistics. We also benefited from available 
empirical data that was collected in the context of short stories (Goldman & Varnhagen, 1986; 
Graesser. 1981; Graesser, Robertson, & Anderson. 1981; Graesser & Clartc, 1985; Graesser & 
lyJurachver. 1985; Trabasso, Stein, & Johnson, 1981; Trabasso et al.. 1988). lengthy fairy tales 
(Graesser, Robertson. Lovelace. & S^inehart, 1980). scripts (Bower, Black, & Turner. 1979: 
Graesser, 1978). and expository text (Graesser, 1981). 
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QUeST: A MQttftf (A Human QuBstion Answaring 

« is Gonyeniem to se$mito QUEST Wo four major componente (see figure 1). First QUEST 
fransJ«e« the cp^k)n m a iogteai form and assigns It to one of several question eateries. 
Second. QUEST identfftos ttie kifomwtion SHirces that are relevant to the question. Informatton 
sources are reprinted as oonceptuai graph mnjctures that contain goal/^ t^rarcWes, causal 
netwoite, laxonomfc hlwrarctes. and desc^iptiye stwctures. Thifti, convergence mechanisms 
con^Jirte the subset of nodes in the Womiatton soufcas that serve as rolevar^ answers to a 
particular c^iestion. These oonveigenra mechan^ nanow the node space from hundrecte of 
nodes In the infonnatton sources to less tt«ui 1 0 answers to a particular question. Fourth QUEST 
consldeiB pragmatic features of the communicative Interaclion, such as the ^als and common 
ground of the speech partlcljMnts. Although the process of t^estton answering is segregated 
STto these tour components, we ^Anowledge that an adequate Of A model would integrate these 
conponents In a highly inter^ive fashion (Dyer, 1983; Lehnert et ai.. 1983; Robertson. Biacic & 
Lehneit, 1985). 

QUEST was not developed to account for the linguists features of question answering. QUEST 
does not explain the process of parsing the question syntactically and the process of artteulaiing 
repi»s linguistically. Instead, QUEST was devetaped to account for the conceptual osnient of the 
ar^were. 

Question cateporbatinn 

QUEST assumes that there is a finite set of question categories, that each question category has 
a urigje question answering procedure, and that a particular question is assigned to one of the 
quests>n categories {see also Lehnert, 1978). For example. How are atoms split? is a "t^w-evenf 
question which generates causal ar^ecedenis to the event "atoms are split* A "why-action" 
c^eston Inv8es reasons and motives for intenttonal actior^, e.g.. Whv does a person buy a 
mmputer? - a "how-action" question elicits the plan, procedure, arxJ style of executing an 
intentional action. A tenrrporar question elicits the value of the time argument wHhin an event 
doK^r^tion. QUEST essentially has a catalogue of question categories. Any given questton is 
typically assigned to only one of the question categories. 

In order to complete question categorization successfully, it is necessary to detennine the node 
that constitutes the question's locus" (i.e.. the queried rwde). Several alternative mies may 
sen/e as the question focus in any given questton. In the Question How is water heat^rt? tha 
question focus is the event Vater is heated." This event is a "statement node" which contains a 
predicate (X heat Y) and an argumem (water). Statement nodes are similar to the proposition units 
which have been frequently adopted as units of representation in the cognitive sciences 
{Anderson, 1983; Khtsch, 1974; Nonnan & Rumelhart. 1975). Sometimes the questton focus is 
an argument of a statemem node rather than the statement as a whole. :n the question What 
ftgatgd the wafftf? the questton focus would be X in the statement node "X heated the water." 

Some questtons are very tong-winded and involve many aatement nodes; "How ctoes a nuclear 
power plant In southern CaHfomia produce electrictty when there fe a bladtout?" in such 
questtor^, the focusing mechanfem determines which of the attemative nodes is the questton 
focus. The focusir^ mechanism is complex, with semantic, conceptual, and pragmatic constraints 
exerting their influences. QUEST does not ojrrently explain the operatton of the focusing 
mechanism; it merely acknowledges this component and assumes that focusing is successfully 
completed. 
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.Sources and Knftwi«»rfqft R eofflSAntatin^ 



^!?«!Sir^.fS"Sf.?c?T'*"^"'*^ ansvyrers to a question. Whenever a 

question ts asked. QUEST cpmpuies an expression with three slots. 

QUESTION (<Q<ateflory>.<Q'focus>,<infonnation sources>) 
The expression for How ts watBr yiftfltt^ft? would tw: 

QUESTION (how-event, water is heated. <information sources*) 

stot supplies «ie world knowledoe simctures that are tapped for answers to the 
questton At least one infonnation source must be accessed Isefore tf^ question can be 
con^jtetely Interpreted and answered. Without an Information source, ft is difficult Bnot 
impossa^. to understand the question and to aentffy the focus. 

When mom t^esttons are answered, several information sources are relevant to the question 
There are episodic knowlec^ stniciures- {EKSs) that conespond to particular epi^s that a 
person expenenced in the past. For example, the answerer m^ have viewed a f flro on nudea-- 
PMw one day. read an artlcte on another day. ami h«l a conversation about nuctear power with a 
wend on yet aiott^er day. These three ei^ertonces wou^ create three EKSs in tong-tenn 
niefn^ In addition to a large Inventory of EKSs stwied In memory, there are "generto 
kn(Mrie(^ stnjctures" (GKSs), A GKS a more attract t^preser^ation that summarizes the 
typical properties of the contef« it represents. For example, there are three GKSs triggered by 
L^*?f ITS oS;fl« NUCLEAR-POWER, WATER, and HEAT. The content of the GKS for 
ri^S . i'^^^!'^ "^^^ probably Include the statement nodes In Figure 2. The content of this 
SmI^"-!?'"^*®^^ '"^"^ ^^"^^ 0' ^Ss that are associated with the GKS. However 
UUEST does not offer any infonnallvB or controversial claims about such relationships QUEST ' 
merely assumes that the cognitive system is a vast storehouse of EKSs and GKSs and that these 
structtires furnish the infonnation sources for questtons. Generally speaking, it Is easier to access 
and to search through GKSs than EKSs: GKSs are veiy familiar knowledge packages that 
sometimes are products of thousands of experiences. 

Many of the Inforniation sources for a question are accessed by the content words in the 
c^eslion, such as nouns, main verbs, and adjectives. Information sources tnat are accessed by 
content words are called Vord-activaled" information sources. In con:, jst. "pattem-acth^ated" 
information sources are activated by the context of the question and by combinattons of content 
words. 



The tf^omiation sources for a particular question consist of a family of GKSs and EKSs (see Figure 
3). Each Infonnation cource is a stmctured database with dozens/hundreds of nodes. It follows 
that there is a wealth of information available in working memoiy when a question is answered If 
there were four irtfonnation sources and each s-urce had 50 nodes, then 250 nodes would be 
available. Clearly, most of these nodes would not be produced as answers to the question Only 
a smafl subset of nodes {less than 1 0) wou W be produced as answers when adults are asked 
questions. Convergence mechanisms specify how QUEST begins with 250 possible answere in 
the node space and converges on approximately 10 good answers. 

Graesser and Cla/k (1985) identified those infonnatJon sources that are particularly prolific when 
adults answer questions about episodes in short stories. They reported that word-adivated GKSs 
fumished apf^oximateiy 72% of the answers whereas pattem-anivaied GKSs accounted for a 
modest Increment of 8% additional answers. GKSs associated with main verbs in the question 
were more important informaiton sources than were GKSs associated w th nouns. Of course, it is 
important to acknowledge that these findings may only hold up for simpij t\ones. 
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to some debate over tt)0 mta^ {»mr«x4tom d the ttxtbam (an 
krwwiedgesttuctures when c^ie^ons are anm^^ Acootdino to Redei^ 

mo(M of que^ answering (fMf, 1987), kxIMd^ 
generto Itfwwaclge th^ decide whether 1 partioriar semence 
they decide whethtf a M sentence h«3(^ had not been pieeentodearfler. As Rodef»ticul«e8 
it. wtttnener verification ^idi^nents and leoof^n ^idj^nenta are macte, the person can ^er 

access a spednc memoiy (corresponc^ to the textt>ase. an BCS) or the persmi can rety 
piausiNlity judf^nts (derived from GKSs). As the de^ betv^ text con^srehension and 
question answering ino^^es. we rely more on generic loiowfedga tiecause the textb^ is iess 
acGessK>ie from memoiy (Grae^ & Naiwnufa. 19K2: Kinlsch. 1988; Rector, 1987). When we 
are unfamMtor with the t«^ dhscussed in the text, we refy prhnari^ on the texthase becsnse lew If 
any GKSs sg9 aMsMM. 

Concgplual Qfaph laructufw. mom k^>rmation sources comam a set of units calied "statement 
nodes." Aooordbig to the represeiititional sy^m adtH^fed by QUEST, the matemeri nodes in 
any given infomiaticm source are oii^euiized in tf^torm of a concspbial^Bphstn^ Thtiis, 
the set of nodes are assist to node calories WKf «e Mnidured by a network of direct 
reiationai an». The node c^egor^ ^ic^ state, event, goal, action, and myie «»ciflratlon. A 

is an ongoing charaii^i^wlUch remains unchang^tfmighout the course of the time 
frame under consideratic^i (e.g., nod^ 1 ,2, and 3 in Figure 2). An smA is a state change within 
the time frame (e.g.. nodes 4-10 in f'H^ 2). Agoal refers to m\ event, state, or styie spedflcaflon 
that an agent des^(e.g.. a perKnwwits to buy a computer, a peiwn wants to be rich). An 
d£lisU2 an adiieved goai, such mat me ^ent cSd sometf^ that <^ised the succe^ 
outc»me). A s&dft ^jecMiciaion conveys the speed, Intensity, force, or quaiitative manner In which 
an event untoifb {e.g., an event occurs quJddy. m circles, quietly), in principle, is possft>ie to 
include additionai node categories m QUEST, but me above five categories were sufficient tor the 
issues addressed in th^ pro^. 

There are several categories arce in the representationai system adof^ by QUEST. For the 
mo^ pan. we «to^ed the arc ca^rtos reported in Graesser and Ciarft (1985). Thearc 
categories in any given infonnation source depend on type of icnowlei^ ctopicted in that 
infomriation source. For example. Consequence (C) ans are quite prevalent in causal networtts 
(see Figure 2). Goal hierart^ coi^bi the following arc categories: Consecpience (0). impUes 
(Im), Reason (R), Iniate (I), Outeome (O). and Maw (M) arcs. Each an: category is ejected, 
such that me end node is connected to the head the arc and the source node is connected to 
the tail: 

(source node) — ARC— > (end nocte) 

Table 1 presents the nJiesf^oompositiDn tor these six arc categories. A complete description of 
this representationai system is beyond the scope of this report (see (Cesser & Clark. 1985). The 
examples in this report should convey the Important character^ics of the representational 
system, 

There are senMimic and conceptuai constraints that must be satisfied before two nodes can be 
connected by an arc of a panicularcatogoiy. For example, there are %ee oonstr^s issociated 
with the Corisequonce ms wtrich are prevalent in Figure 2. Consequw .oe am cm\ rel^e 
evem/state/styie nodes but not goal nodes. The source node mi^ have occurred or ex^ed in 
time prior to the end node. That is. the catoe mu$t precede the effect. The soume node nust 
play some causal mie in producMig the end node rTrabasso^ at.. 1988). Thatte, MthesHirce 
node is negated or removed, then the end node would never occur. Two nodes cannot be 
related by a Consequence arc if any one of these mree oonstraii^s are violated. 
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(Maiw & Thompson. 1986: Meyer, 19^. and conc&piua gn^ (Sowa. 1983). 

^jgi'""^ - ^»»v9(tevotedfnMt of our iweaich efforts on tour t«»e8 of 
fr^JSJl .^*^f ^^'^^^'^^ Second. floalWerafi^s convey me pteTOaJ 

i^^!i5!L*iT^^"**^*^^*>- Thw.ta)conon*werarttf»iesipec%t^ 
pa^wnlM si*edfy flow reoions are en 

teS^2S27/J*^^ WeacknowtedgeZTf^^ 
S,Si^^iS!.Sl!l**?i52f^ ^ Of 8e9re{HAbig these types of 

SSSs tL^S^TL"!^^ characierfetlcs <^ both the stnicaSS aSrf tt^ <5a 
proceouies mat operate on the structures. 

h!!™^?^*?!f'***"®°"*'*®***^*i^ GraesserandFrmWftKinDr^) 
has a more detailed descrif^n of QUEST In the context of all four t^s SS. 

Converoencft Mochank^ 

^^^^ "^^^ kilomaS^ sources In working 

nwwnfo^ each infor^^ 

SrS>^ZI^«' ilSLS;;^'! T ^^'^ ^ W««lmately 1 0 good 

answers. Convergence is acoonpHshed by ttiree componems: (1) an imersect^ nodeSrSiffer 
(2) an arc search procsduie. and (3) oonstniim sat^acNon. '^^"wcw^ nooe wenuner. 

I ntfirsflctlng nnrifla a tid stntftfi i fBl amm . An intersecting node idemifjer isoiates those 

P'^*^' may be stored in 'n^ Sion 

these Mersecting nodes hav, a higher 
Jjewiooch^beino produced m answers ttwn do nonMersect^ nodes (Goldlng, GraesseT & 

^nadditton.nonlntefseSlngnodUhav?i^^^ 
intersecting node. Thelilcelhoodof anode being produced as an answer decm^ 
& Clarfi. 1985: Graesser & HempMM. 1990; O^aesser. i^n^w, & Brakierd, 1989). 

Wersecting nodes provides one convergence met^anlsm but does not go the 
d«tance In reducing the node spa. .frw^ TfiarTseSc?; 

tttefenpomwittonote 

that tt^reported^jr^ of stnjctural distance on ^iswer production does partlal^SwSrfaS 
from the arc search piocedure and constraint saflrtaction. wiwiouiwis 

The priced effects of ^ructuml distance on answer product and answer qualSy would be 
SSiZ ^ quemion answering other man QUEST (Shanon. 1983; Wbiston, 

.ii ^*^"^^***«^^«Q<Afno^Jel that answers why and how questfo 
goai/pian hierarchies and problem spaces. Hte Q^A algorithm spectfles tha good oiswers are only 
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one arc away fmm me queried node. 11i90fie8(^niai«HH'pBS$Mioffiid^>rea(inaa(^^ 
(Andeieon. 1983) would atso predict the (Sstwice ^»lient. 

AmSftamhffWBmtim - Eacti qu^lon csaegofy tts own arc seanch piocedure thctt operates on 
tttefntormatfonsoufcesnftle^nttoai^je^. Wwn a gh^ Momttion source is messed, the 
arc saaichf»iK»AJie firsi icientlfies an "entfy node' Inthe bifonn^ton source. The emiy nocte 
usuaiiym^chestheqii^ionfticus. Fey eKaitoie. if the auction is How tewat^ha^i^ ftnrttho 

iftfoniiation6ouiceisFigum2.thenevem6wouldt)etheemfynodeinthestnKmjre^ In some 
cases, the entiy node does not match the question focus; instead R nMies an tntersecttoig 
node tjetween two Wonnation sources. For example, the node "energy to release* n«aht he 
stored in itw QKS tor NIKILEaJ^^v/er and tTw QKS N>r HEAT, "mesa WersectInQ nodes 

would sewe iffl entty nodes even though they do not the fflj^km focus O^r fe 
heated"). 

Once an emiy node is located in an (nfem^ion source, the arc search procedure executes a 
bfeadih-ffr« sean.^ from the er% node Ijy pursuino legal »C8 that faifi«e fnwn ^ 

For each question c«egmy, there is a pfflticu^ set of «c ctiegorles and arc dkedions that are 
legal. Figure 6 shows the legal paths for queried ever«s. Some (^mstlon categories pursue 
causal amcB6»m on pmhs of backward Consequence am (namely the why. how, «^iwi, and 
wiabie qitfstior^ whereas mhwc^e^icMi categories pursue caisal consequences via fotwani 
Consequence arcs (mGimly consequence and wt^»tf questions; ConjMder me question ^mj& 
water hgjHBffl? in tt» context Of Figure 2. i.e(^ answers would be nodes 1.2. 4, »id 5 whereas 

Uiegalar«werswouidbenode8 3.7.6.9.andl0. The legal mswers would be entirely d^emr« 
for the question What are the conseouantMs of waar hetn^ hi»iiti>ri7> noc^ 7-10 but not noctos 

1-5. T,ie fact that yiegalp«l» are pnmed from considera«on8iMJSt»itfe% narrows d^ 
^)ace. 

Con^ knowledge stmctures have a more t^mna distribuaon <a arc crtegwies than merely 
Conset^ence ms. A ccmipiete specificatiwi of a (ausai ffl«ecedeni patfh cons^ of any 
con*inat*onc^mef(«owlngarcs: impfles(foraanj or b«;kw^.baci(w^ Outcome. bs£k«^ 
initiate, and baciward Consequence. A causa consequence pam omsists of any cpr?^naiion of 
the foSowir^ arcs: Implies, fonvard lr«late, fonvanl Outoome, and forward Coiseci^ence. A 
complete accourt of the legal pahs for cflfferert question crtegwles is provfeW in preview 
pubiicatk^is (Qraesser & dark. 1985; Qraesser & FiwHOin. ^ press; Graem & Murachver, 1985). 

Goal hieraichies (see Figure 4) cental a set of god nodes thia ae toitofTeiated by Reason aics. 
Goal hierafchiesfrequeftfy have Marmer arcs and omerchmrtetMics but we wdiignc^ ti^ for 
themomem. Supemrtfinaiegoiris are at thd^ of the goalt^rart^ whereas tow-le'>el 
subordinate goals and actions are at the bottom of me f^ichy. Figure 6 shows the arc search 
procedures when a goal (or acaon) node Is pn^)edwfth a que^. Answers lo why and 
cor^equence (abbreviated as CONS) qMstlons puim »jperci<^e goals via fbnvard Reason 
arcs. Ar^swers to how, and enM questbns tend ^ pursue si^risnate goals that ladiate 
from thf entry node on p^(^backwffidRea8(m arcs. For exiMWJie. oon^r the guegtton vwy 
dMByicaHifflontriflHriflPhQnaZ Legal goal mvers would be nodes 1 and 2 (in order k> feel 
benf».-.inordertot^to^)whefeMiBegaigoal««werewouMbenode83. 5.and8(^crJ»t 
go to a bar, Conifer to wale to a couch, m Older ft) {^Jtfs number). Tlw legal goal answem would 
beentfrehrdilferertfortheQuestenHowi«dB«eflBjmrinthfttiik^^^ The legal answem 

would be nodes 5 and 6 (BW wasted to a couch. BM cfiaied Jil^ number) but not nodes 1 , 2. and 3^ 
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GoalH>rt©rted Knwrt«jg» Is i»rnw«y more 

«»iWOTn«qt«j by f^ason ms (Mflter, (Planter. & Prajram, i960; NdweA & Simon 1972- 
Scivm. SHdharan. & Qoodson. 1978: Witensky. 1 983). 

(1) ^<^floalnoctet aw frequency patSu^ In the fomicrffri^ (Schank& 

ADMIfNIt 1977)* 

(2) Some goal nodes aia amnoasd by Mannar e«» ra«w it^^ 

(3) Go* nojJes are nomiafly irlogefed 1^ siatas/eventt bi the 

Ml i^^lSL^' ^ ^ hterarchy in Figure 4 (nodes 1-6). 

(4) SibSnp nodes fri a goat hIefatcHy are reteted t>v add^kw^ «ca fan>^ or betow) . sihiing 

no^are^nmscfetelydomkiMt>ythe8«neparmgoa^^ rorexanY>ie.no(tes 3 
and 4 hi R^jre 4 would b9 raiaM by flc because ttie achtevemew <^ either or» 

these goais would and i4>en^«no the parent node. Nodes S and 6 wouJcl be related 
by a arc because goal 5 would need 10 be acttoved prior to goal 6. 
{5)GoalnodesniayorniayrwtbeacWevidwhena9oalhtorfflchyisfcistan^ Whena 
goal node Is achieved, an event/st^e is constnifled and Snked to the goal nocte 
an Outcome are. For example. If Bimm^ to c^JjJTsrHJffiber, men thera would 
be an avert no(te (Bffl dialed Jiffs number) In ade^ to lha goal r»de (OB waited to 
dlalJWsnun*er). AninienUonaiactioni5antfnalgemaiionofagoal^itsoi>toonie 
node. When a goal Is ncrtac^i^vad. there etther Is no goal notto or a negative 
outcome node (e.g., BM did not dial Ms r^mber). 

Given that goal hierarchies are r«heroon^x 8tri«ui«8. the arc saari^ proceduw 
actions are more complex than the procedures depictad in FH^re 6. R}r example, answers to why 
que^lons IncHide (a) superonMn^ goals ^ the goal hlwarchy . (b) sibitoig nodes that precede the 
entry node^ «ates/8vent8 that ir^tiito the goals, and (tO causal antecedents to the goal 
initiators. Once i^ain, a more con^jlete spedficmion o< the arc search procedures ara provided In 
previously pubHshed stuc»es (Graesw a ClarK 1985; C^a^r & Frankyn, in press; Graesser & 
Murachver. 1965). 

We have written a oon^juier program that generates legal arswers to many dWerent types of 
questions ih^ may be asted in the owttext of causal networks, goal hieraichies. taxonomlc 
stnjcturss, and sp^iat parlonomies. The user specifies one or mere infonnation sources and then 
enters the que^ton. The computer fets all answers that would pass tt» arc search procedure 
associated with the quei^categcHy. The f»ogramte written ^ConronUSf. Ithasbeen 
implemented on a mterocoff^juler (i^ done), a LISP mai^ (Texas Instiument EXPLORER II), 
and a parallel computer (INia hypercube wWi 16 paraUei systems). We refer to these 
implemer«ations as mtoioQUEST, QUEST, and hyperOJEST, respectively. QUEST and 
hyperQUEST are needed whenever muRif^e Informatton sources are ratevant to a t^siton. 

The arc search procedures m QUEST are compatible with some theories of qpiestion answering In 
artffkjial intefflgenca which have enphasfeed the Importance of knowtedge orgar^ttion and of 
restricting search by pursuing partk»iar concepbral relations (Dtftlgren. 1988; Lehnert, 1978; 
Lehnert et si„ 1983; Schank & Abelson. 1977; Souther et al.. 1^; Winston. 1984). 

Constraint gatlafacfign The semantic and conceptual confers of the ar^wer should not be 
incompatible with the content oi the (^ried rKide. ConstraoH satisf actk>n dtecards those 
candkJate answers hi the node ^Mce which are incompai«>le with the focus slot of the c^estion. 
Stated differently, the t^iestk^n focus has senwntte and conceptual constraints that are 
pn^pagaiwS among nodes In the node space, ultimo ely paining out the incompatlbie nodes. 
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TTYore wv several ways in which a cancikta^ 
focus. Oimefiskm8 0finooiii|»HI)Mly^ 

(19^ Theex»ni^8nnml»lowMu«f]Biatwodthss8cB^^ 
<»JB^ focus ]Katfld&llfiatel and a ni«^ poiw pte^ 

(2) Waiflf fe§ from tta sKHHft. Thisnodekivoiv^a'^frm*ino(»ip^biMy^^ 
raMafi for a Qh/sn w««' moieoute is enttely outside of 
walar nvisGule be^ heM in a nuciear power 

Comia<ficiion and time fmtm irwomp^i% 

conned cM^constnyntsatisfac^ '^iare«igincompMainte»' occur whenever a plan 

conveyed b) a cancfid^ answer (e^y.. person saves nioney) inoompatisfe iMtli 

conveyed in me <vj«ied node (e.(^,pen»ntiuy8compitei). "Causti stien^ is the extent io 

which the candidate answer is caustic leiated to the queried node: a cffii^^ 

pnmed out if Is not cffiffianyrelatad^ the queried node. "Aigument ovarii^ computes whether 

th8twonocte(candiitatemwer»idqw^nocte}shamoneornmconim(maroun»nte. A 

"(Nffijsmy- dimension specifies w^i^ter the cancfidate an^ 

general world knowtedge; bnpiaustt^ nodM in t»« imowiedge base wcHJid 

A sbnpto (»nipi^ion of oonstraini 8ati^a^(»i would ev^lua^ eac^ candidate 
dimensions: oontr^fictitm, tftne franw bicoinpatibWty, piara^ tonipmS)itty. c«»ai strength, 
ari^mem ovrntep. and piausliiflty. Acooiding to a strict cftteiion, a cancfidate node panes 
constraint saUsfactioitfaa of ttWGttnenslonswesmisftod. Acooiding to a weaicciterion, anode 
passes if most dimensions are satisfied or I niost dfenensions are satisfied to some degree. 

intflfactioftt anma oomponflnts of ctMiv«ti(»ftncfl The three components of convergence (node 
intersection & stnjctural chance, arc search procedure, arid constraM sattef action) are able to 
nanowthe^MceofcanSdatenodMtoappiDidmateiyiOgoodanswerstoaque^i^ Given the 
adequacy of these oonveigence mechai^sms. one n^ tf^ coralder how me thr«e 
components Interact For example, one posftkm might be th^ the three components operate 
independentfy and ad(«iveiy. If so, mere should be main effems but no MeracNoitt In analyses 
of answer quaMy and in analyses of answer pitxiuctionsoom. ASemativsiy. tt» three 
componer^ n^ be executed intmactiveiy. Of course, tt^ iimer posttioii would not be 
paiiicularly infonnative xmas S predicted systennrtic output. 

One obvious posSion to consider is ttm the components are executed In sequential order, wWi 
operation of component N bemg dependent on the output from comporwm N-1 . For example, 
suppose that the process sequence below was correct. 

Stage 1. Find alMersecting nodes among the infomiation sources. 

Stage 2. For each infbmiation source, nndwi^my node (e g,, convsponcttng to the 
question tocus) ffixlrandon^ sample a candidate answer node. • 

Staged. Apply the aic search procedure to detormkw whether the candidate answer is 
onalegiripathex^idbigfiDmthe«trynode. If me answer is iOe^ then stop and 
decide that the canc^e node is a bad answer. 

Stage4. Apply the oonstmim satisfaction component to chedc wheth^^ the candkfate 
answer node is OMnpatlMewtth the entry node. If tenet compatlbie. then stop and 
decide that the cffix^e node te a bad answer. 
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aw 5. CoiT|H4»tlte8tni(»jfaliJislancebeh(wenttiecandi*^ 
noda. w the (lstafx» score tehij^. then (tectte«ii!flTec«Tc^^ 
9nsmr. »♦ .cfiatencesowtekM.mendedilethatthecfflicfidatenodeteaQo^ 



The stotm Mquential process^ mechanism would predict some WeractkHis. Fim. there would 
te a2^«ay inteiBCtion b^ween arc seoi^ and coratf^ 

no! reach stage 4). Second, th^ would be a 3-way imefa«lon among arc search, cor^int 
satisfaction, and stmctural mance; stnjctural (^stance would influence answer qu^ scores only 
for ieg^^tswers th« safi^ OK^rafnts {l.e.. nodes that reach stage 5). Decision latencies would 
generaHy be longer for legal than i&egal answeis becaise legal answere would pass through more 

St3Q0S» 

At this poW. me QUEST model does not spet^ particular Interactions among converaence 
wmpoiwnls. There simply is nrtenou^empwcal data to make any defWtlvecteiinis. However, 
the dudles ki this report do peiiodicalb^ provide some data thai are relevant to thfe issue. 

Praomattei 



The f»agn^ components ^ttress the social and conminicmive tundions of ans^ 
<lue«ion. CXie component consWers the goila of the questioner and answerer. From the 
peigpective <^ the wsitoner, a question may be asked fti Older to acquire Wonnalton. to solve a 
prot^. to assess how much the wiswerer knows, to persuade, to control a conversation, and so 
on. From the perspectlvod the answerer, the answer may be fermulaisd to irtfonn the 
que^ner, to let the questioner know the answwer knows somethtoig, to entertain the 
questioner, and so on. AcompletemodelofQ/Awouid consider the goals of the speech 
parlic^arts in a partkaiiarcflsoourse context and woukidetemibie howfiie oism^ are tailored to 
achieve these goals (Aflen, 1983: Appell, IMS; Bnjce. 1982: FrantHk & Clak. 1985; Kaplan, 



One Important goal to assess is whether me c^iestioner genuinely seeks ttw Infomiatlon 
suggested by the quemion. Some questions are m gemne Wonnatlon seek^ queslior^: 
Indirect requests (e.g., Wouid you pass the salt?), greethgs (How are you dWng?), gr^ (Why 
does thte always happen to me?), and rhtiortcalque^ions. Van der Mel (1987) tesldentWed the 
assun^kwis th^ must be met betore an utterance constitutes a genufr>e Infonnaaon seeking 
question. These assumptkms are Osted betow. 

(1) The questioner does not know the Wonnatfen asked for with the questton. 
(ki) The questkmer beHeves that the presupposittons of the question are toje. 

(3) The QMsttoner believes mat an ffiiswer exists. 

(4) The questimrwante to know the answer. 

(5) The questioner can assess whether a reply constitutes an answer. 

(6) The questtoner poses th* queirtton orrty » the benefSs exceed the costs. For 
example, the benef Ss oi knowing the iv^wer mum exceed the costs of asking the 
question. 

(7) The queifttonerbeSeves that the answerer knows the arawer. 

(8) The questioner believes that the answerer wW not an answer in absence of the 
question. 

(9) The {^jestioner beNeves that the answerer wU supply an answer. 
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A second pragmatic component is the common ^outid (i.e.. shared knowiedge, mutual 
Imowiedpe) betwem question^' and ansMwrer (Ciartc & M^shtf , 1981 : IMiyito & Nonnan, 1979; 
Shanon. 1963: ^eeman A Brown. 19K). Acconing to these modais, the answerar first 
estimates the common gnsund between speech paftic4>ams and then seiects an answer tt^ 
moderately extends the boundaries of ttw common ground. That Is. the ar^awer shmiid be 
somewhm more ^fornvrthre, elaborate, or detailed than the common ground, but should not be 
(a) entirely wW*) the Sf^iers of the common ground or (b) substantially more detailed than the 
common ground. 

in principle, the QUEST model is able to ke^ irmk of the common ground between questioner 
and ai^werer. QUEST would evaluate what feifomiation souroes the questioner has ^red in 
menioiy and what nodes ^qiMStioner has ^red in e«:hinfomi^ion source. Thefringeor 
boundsy of knowtedge can also be computed^ a sttalghtft>jw«d manner, inpankajiar. afringe 
answer would be few aics away from a node In the common ground. 

Ck)mmon ^ound CMid have some ccHimerWtive ^ecss on answer c^M^. n the oonimon 
ground is high and the ar^werer wants to be infonn^lve (1.8.. supplying infonnation that the 
questioner m know). th«i the answerer wouM avokl nodes that »e In nultiple Womiation 
sources. Surprteingiy, there would be a negative conreialion b^ween answer quaity and 
of Hiformation souro^. Re^n^^mctural distance and common ground, thenin^ be a 
preference forcSstant nodM became proximitte nodes would be easy to infer. Perhaps a 
ojnriiinear relationsf^ would occur, wim answers at Imiennecfttte chances b^ better than 
answers at close and ^ far ^stances from the eitiry node bi an infonnation source. 

This section has ctoscrfbed QUEST and tms identMed its theoretical kwndatlons. One of the 
oi^ecthres of the ONR oomract wffi to assess the extern to which QUESTS cofTponenta can 
ex^in empirical d«a hi que^ton answering tasks. The sUmSies reported hi me next section were 
conpleted under the ONR contract in order to address thteob|e(^. The resuRs of these 
studies wers quite promts^. Therefore, we conclude that QUEST Is a piausibie p$y(^logicai 
model of human question answering. 



Tflsts Of thfl QUEST Moael 

QUEST was tested in four dBfersnt infonnational contexto. These contexts inckided (1) 
expository texts on scientWc mec^iar^sms, (2) narrates texts, (3) generic knowMge stoictures 
(e.g.. objecte. person ooncepla. scripts), and (4) situ^lons with complex pragmatic constraints 
(i.e., telephone sun/eys, butinsss tiansactkMis, and telexed interviewsi. In some experiments, 
we examined answer prodLcJon scores, that ^. the likeWx)Ob that a node in an mtonnation source 
was produced as an answer to a question. In (^h«'exp«tiiien&, we exmnined answer quatty 

scores tor question-answer pairs. I.e., whetf»r the answer is a good versus a bad answer to the 
question. 

StiiriiftS of Fxnositoiv Taxts on SetonHfip. Mofihanfem« 

Two ^udies foosed on short texts that describe event cUm^ in phystoal. bl(Nogk»i. and 
technological systems (Qraesser & Hemphi. 19^; Graesser. HemphiH, & Brainefd. 1989). Each 
text had five events, as Mustntted in the text betowon nudear power. 

1. Atoms are spot if^ paiUdes. 

2. Heat energy is r^rased. 

3. Water in the surrounc&ig tank is heated. 

4. Steam drives a series of turbtoies. 

5. TTre turbines produce electricity. 
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^?^f'JS?i!^'^^f*CS ^^^"^ ^r^^ ^ these sciemific mechanisms 
!2; ''"^ ^ ^^""^ as an infonnation souice. Moreover, we started out 

C^r^^^"^ ^ ^^"^ "^"^^^ ^ * ''^^^ oonnectedby 

(El J {E2J .-C~> [E3J -C-> IE4J -C-> [ESI 

lI^t^'^TilJl!^**! assumi^n because reactors sometimes imposed a goal- 

^l^jJ^^^ intejpret Jton on the event sequences in the case of technotogteal and 
^l^c^r^^^l^T JI? ^' ^^^'^ ^ P"'T»se of ad^evingsXsequent 

P**^ Of hav«g steam drive a sSies of^r 5s 
n rZ/ ui^' »p9*!«e« 0* a ""Clear power plan: would design the plant with such goals 
m mind VWwnever a teleological inteipretation was hi^x>sed on the Text, the textbase arsSed 
oiagoalstnictureiufmlngparatelwtth^casajchaln(seeFigure7). °^«^ris^eo 

We t^edQUEST examining the ansvjrers to five c^estion categories; why. how. when 
e'. Ne, and conset^ence (CONS). For example, evert 3 would he probed with the toHowJlna 
Qt {K>ns.' ^ 

Why is water he^ed? 

How water h^ed? 

When water heated? 

What enabled water U) be t^ated? 

What are the con8e(;Mences of water beerg heated? 

Giventhat ea:h text had 5 events and that there were 5 que^n categories. 25 unique 

Slln^^'!!!lf^^^^ Although subjects occasionally generated irterences 

I^S^ Jje^iore. we analyzed only those atmmn that refened to events 

^S^r, ^ ^^'e^' concerned with four possible answers to each 

question For examf^. the querfed node in tl^ above questions is event 3: we analyzed those 
answers that re*ened to everts 1 , 2, 4. and 5. 

These studies on expository text were designed to t^ the three components of the 
^^"f^S^Hf^^' ^^^^ d^ance. the arc seari^ procedures, and oonstrairt 
s^isfortton We tourd robust support for stmcturel d^ance aid itm arc search components but 
not for constfa.rt s^isfaction. "merefore, we m concentrate prftnartly on the two successful 
componerts; analysis of constrairt S£disf«:tion vifOI be saved for the end (rf this sut»ection. 

Fijire 6 shows the arc search procedures the QUEST model. Le^' answers to how enable 
and when questions are causal artecedente to the queried evert. If evert 3 were probed with ' 
nese types of questions, then legal answers would be everts 1 and 2 txjt not everts 4 and 5 
Legal answers to CONS que^ions are causal consec^ences (evertfs 4 and 5 but ntH everts 1 
and 2). Legal answers to why questior^ depend on whether a ^aJ stmcture is superinposed on 
tne causal Cham. »nrt, then legal answers to why questions are the same as answers to how 
enable, and when questions, if a goal stnicture is superftirposed, however, then legal ar^wers 
include causal consequences but not causal artecedents. 

The stnjctural d^ance componert predicts that proxhnate answers should be better man distart 
answers. Presumably, stmctural distance has an irtluence on legal arwwers but r« on 
answers. If evert 3 were probed with the t^esttons. then the foltow&w predictions would be 
generated by QUEST. 
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How. enaJjte, when, and whyCnomeleologicaO quest tons 

E2>E1>E4«E5 
CONS and why(teieoto^O que^ions 

E4>E5>E1 mB2 

Given that aN 5 events ^ a text were queried and thjtt tfmra are 4 possible answers per c^jestion, 
there are a total of 20 ceUs in a conpiete "question-arwwer inatrk * We analyzed how weU QUEST 
could account the dependent metres (i.e.. answer pioduction scores and answer quality 
scores) in ttte questton-answer inatnces. A que^ion-answer matrix was prepared tor each text, 
and sep^e matrices were prepared for eadi (^»sticm category. 

JS35i&' The texte were 24 emiA sequences tm %vere extracted from passages in the ^coadcaa 
Academic Encyciopadia. ABtextet^ five events, ss in the example about nudearpoMwr. Eight 
passages were in ttw technolod^ dom^ (computer, television, paper produdion. nuclear 
e:«rgy. elevator, vacuum dearw, water purification, and wine production); ei^ were \n the 
biolofl^l domaNi (heart seeb^. photosynthesis, knee Jeric, mi^, hah- growth, hearing, 
neurons): and 8 were in the iitfiysk^l science domain (tornado, earthquake, fight, rain, sonk: 
boom, r^kie, supernova, and stalagmite). 

A questton-ar^wer matrix w^ prepared for ei^ text and Mch questton category in aB analyses of 
answer quality scores and answer productton scores. Given ttiat there were 24 texts. 5 question 
categories, and 20 ceils que^ton-answer matrix. 2400 scores were inckided in the item 
analyses. 

Answwr pmductjon scores. We collected an^rer production score from 192 undergraduate 
students at Mempito State Uf^versity. The subfecte first redone of the 24 texts and later 
answered 25 queens about ttto text (5 evente x 5 questton categories). The ^ questions wsrb 
randomly presented^ a booklet T^ bianic lines w^ared after each que^n for subjects to 
write down the^ answers. Eight sub^s were randomly assigned to each of the 24 texts. 

The cr. jcal dependent meast^ was the answer produc^n score. Thte was computed the 
proportion of aut^ects (out of 8) who generated a particular ar^wer to a partteular queried event. 
The score associated with a particulffi' question-answer i^ was the basic unit ^ 
analyses. Therefore, attested ^atisdcalsic^k:anceasi>essedvariabiity among items in 9s enor 
temi. but not variabitty wiong si^jjects. AH mmteSfcai tests were perfonned at the p < .05 level. 
(V»'<) wiH not present the exact F-soores ^ this report; instead, we will simply announce wfwther an 
effe::t was versus was not sigr^icani.) 

Figure 8 prasens questkm-»iswer matrices for the five questton c^egories. Each imrbi 
F^eserto answf r production scores as a functk>n of the five serial positions for (^lestimis and the 
five serial posi^ons tor answwB. The scores^ the upper-right half of eat^mittrfxoorre^xHid to 
causal consequences whereas the scores in the bottom-ieft haS of the matrix correspond to 
causal antecedents. 

An inspection of the mem answer production scores conf ^ QUESTS arc search procedures tor 
how, when, enable, and CO^^S c^esttons. Mean scores of c^jsal antecectents were signRbantty 
higher than the scores of causal consequences in the case of how questtons (.21 versus .06), 
when questtons (.36 versus .10). and enable questtons (.26 versjs .08); as predicted, CONS 
(Restore showed the opposite pattern (.08 vereus .36). However, mean scores of the why 
questions were approximately the same tor caimai antecedents and causal oonsequf*na^ (.16 
versus ,17). 



JV 
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SSSf^ earaer^TOwofB to why questions would tap caisai consequences If teleofogicai 
stoi^ires {U.^^ stmctufos) were svpetlmposed onto tf)ec&i»y chains (see FigurB TTiWe 

0^ anjduies to be imposed on technologicai domains and perhmjs 
»f ^ ^^^"^^ ^^"^ domalrtj. -meretore. we performed se^e 

^^J^Ji^SS!!!'^'^''^^^^'^^^^ When wfy questions were asked in the 
S^^^^Si^^' ^JT''* signiflcantfy Ngher tor causal mecm^ than 
ca^ oon^ences (.30 versus .10). Thus, goal stnictures were m conned when mese 

T^^^'r!^^TT^ lnoomrast.meanscoreswereslgntfSmJ^X 
Sf caisal oonsec^ences when why questions were asked about events 

the primary stnjclure In the case of technologkaJ and biological doma^ 

2>^^^''l2!?Jil^?^^ ^^'^^ ^^"^ on answer prodi^tion scores w^ also 

oortHTTOd. This should be apparemvirtTenlrepectinfl the scweslnFiBurB 8. Thesoores 
decieasedas a function of the (^stance (number of arcs) bt^weeti the curled no<te and ttw 
answernode. When aver^ over a« five <^ton categories, the mean an«wrprodu«lon 
spores fbr legal answers si^icaf% decreased m a function of dtetance. .42. .2B, 21 , and .20 at 
distanc^ 1,2. 3. and 4, respectively. Tftis d«»easefft an exponentitf function better than a 
finear func^n. In cor^. ^nictural chance did not have a cons^em si^itflcant effect on iiegai 
n^H^^L.^^ averaging over the Ave questk^ cate^ries, the scores wre .1 2. .06. 04. and 
.07forc«stances(rfi,2,3.and4.rBspectivefy. 

pJISS^S??iJ^^ *^ ^ parameters ctosely ft the answer production scores. 

wteHhood of pursuing a caisal consequence path. Parameter jte the likel»xx)d of traveishig a 
single arc on a path. Theieisflxedparam^nthatcwwistsof the nun*er of arcs between the 
c^erMnxieandtheanswernode. The prsdfced answer pnx&ictlon scores are conputed as 

SSic.!^S^fIlS^»I?22l5*'"'*^*^^'^^ This simple mo<tol accounted for 
89% me variance of the answer pnxiuclion scores. The best f8 value of l w^ .67. the ^stance 
dampenftig parameter. Regarding the causai antecedent parameter (a), the be«-flt value was 
subslantiafly higher In those conditions in wWch antecedents were legal (.63) than &i those 

^^^^Tn^ai^J^^!^??^^:!^'^^^ '^^^- R«8a««n9t»>«caisal consequence 
parameter (g). again the best-fit values were higher when cor^ec^iences were legal ( 51 ) than 

when consequences were Hlegai (.19). 

G<?QdneM-fif -answflr htffcHTffinls . QoodnessH)f-answer(GOA)jiKJgementswerecoliectedonihe 
24 texts and vireie ana^rzsd hi the same way as the answer {deducted scores reported above 
iVtertr^ subjects read a text, th^ were presented a ser^ of qu«ai^^ Oneachof 
these trials the siA^ dedded whrther the answr was a good versus a bad answer to the 
question. ThepattemsofGOAJud^nerts were expected to be similar to those of the answer 
produ«k»i scores even though the t»ks were somewhat cOferent. Whereas the ar^wer 
production task re^^lresthesui^to r^rieve answers from memory, the GOA task places less 
demands on memory and more demands on the ludj^nent of answer quality. QUEST makes 
idential predkitlons across tasks rating tlw arc search and stnjdural distance con|)onents. 

The GC)A task pennWed us to impose some oontfol over the partfcular arc categories pursued 
when the arc search procec^res are executed. This control Is acNeved by spec^big the 
connecth^ that precwies the af«wer. In the case of why-quMtions. causal antecedent paths 
Should be pursued when the answer Is preceded by iMCautt: that to, bMkwarxJ Consequence 
arcs are sampM. In contrast, when the answer oreraded bv in oider tMor tho gflni «tnir»iim 

and fonvaid Reason arcs shouW be sampled, corresponding to caus^ ooi^equences. For 
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exan^, oonsicter ttie foBowbig four answers to the question Whv is watar hftatu^^ in the context 
of the mjdear power text. 

Because atoms are spNI. (causal amece(tent) 

•tnofdsftof atoms to tjespHt (sutsoidinate goal) 

* Because tuibines produce eiectricay. (causal consequence) 
in order for turtJines to proi^ electricity. (superorcHnate goal). 

The answers with a^er^ are Hlegal whereas the answers wfthout astern are legal, acconSng to 



Legal answers to when-questtons are also governed by connectives. If the answer is preceded 
by alifi]:. then legal answers are caisatiuitecedents. If the answer is preceded by b^i^. than 
legal answers are c^^ consequences. If no connective precedes the answer, then legal 
answers are causal ar^ecedents, according to QUEST. That is, the default connective is attflt 
(see Graesser & Frankin, \n prsss: Qraesser & Murachver, 1985). 

In light the expected bnpad of oomiectives on QOA ^Jdgements. this study had nine 
condltior« altogether. In five of the ooncSttons there were no connectives preceding the answer, 
conesponding to the five types of questkins. Time were four conditions wtth connectives 
preceding answers. There were two connective condftions for whv-iais^ng fbecause wmm jn 
anterloW and two for when^esMons fatter versus tseforo^ . 

The subjects were 1 62 undergnKfaiates at MenpNs State University. These sut^ects were 
randomly assigned to the nine que^n cwxfittons. with 18 sidijects per ooncHtion. Half of the 
sutsjects read and were tested on 1 2 of the 24 texts; the other half of tlw st^jects wre assigned 
the other 12 texts. The prei^ation order of the texts was raidomized for each sidsject. 
Associ^ed with each S-evert text was 20 {Afferent question-answer ttems, as deRn^ed in the 
20-cell c^iestkm-answer m«rlx. After the sid^ read a text, the sub^ provkJsd GOA 
K^dgments on a« 20 Hems, which were presented In random order. 

A microcomputer controUed the presentation of \ho passs^. the c^estlon-answer terns, and 
the collection of responses. The Sidjject began each question-answer Ufa! by presslrHj one of the 
keys upon recelvftig a READY si^. A}tera.5sec»nddel^,theque^ioni^aredonthe 
screen. ThesubjectreMf the (^lestionm his own pa» and prised a key when finished. Aftera 
.5 second delay, the questfon cKsappearsd and the answer appeared on ttw soeen. The subject 
provided the GOA judgment by pr^ng one of two response buttons (GOOD versus BAD). 

A GOA score was computed as the proportion <rf obsen^ns in which subjects ^jc^ an 
answer asGOOD. Inailtestsof statieficaisignfficancewewereabietooompi^ aa^stattstfe; 
this is a consenrathfe test M oonskJers both variatNlity among sublet^ »xj vari^^ 
items. As In Figurs 8. we prepared a que$tk>n answer mmrix for eiN^h text, for each subject, and 
for each question category. 

Mean GOA Judgments were in the e4pected d}rectior« when comparing caisai antecectents and 
causal consequences. QOA judgments were signif icaitiy hi^ (or antecettonts than tor 
consequences in the following six oomfitons: why [M versus .29J, why(becau8e) {.52, 27], how 
[.48, .181, when {.53. 211. when{afler) {.67, .13J. and er^M {.63, .23). GOA )udgment8 were 
sgnlf k:ai% tower fpr causal antecedents than for cmisal consequences in the foOowing three 
questton groups: why(in order) {.26. .57). when(before) {.16. .79}. and (JONS { 25. .^J. The 
difference between antecedents and consequences was smafle^ &i the why gm^ [ 19); th^ 
woukl be expected because this is the only group in which legal answers depend 01) he 
knowledge domain (i.e.. phystoal science, biotogical, versus technok>gk»l). 
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^^^^I^^H^!!^^'^ ^ ^ ^^'^i legal 

for distances of 1 . 2. 3. and 4, raspectlvoly (averaging question cmegortes) 77^ 
differences were very suljtfe tor iaesalara«wefs. .27.l9 J7.vef^ c«esones;. ine 

Constr a int satWactton and mmtinte n^m^^fy mm n n The patterns of GOA judgments hi the 
^ve analyses conf frnied QUESrs (»«dlctjor» wtth re^ to the arc search procedure and 

^ <^ Judgments. Tww dimensions were considered In^analyses of cor^tralnt 

i^^S! '^^'^ ^ shared at le^ one noun-argument and If 

there was a high causal strength t>etween the two nodes. 

Trabasso-s anal)^ of causality when we scaled event parts on 
causal^rength (van den Btoek. 1990; Ti^»o & van den Broek. 1985; Tr^»sso et ^1988) 

^ tow; crteria: temporaflty. operatlvfty, mces^, and sufffctency. The 
r^^^S^^^^I^.^ evertt X (the cause) mu« f^ec^ evem Y (the effeci>7time 
The operattjjy crfterion 8t(«es that evem X or the result of evert X niist be ope^ 
IS^w ^ ^ ' «^ ^ Y pass t^ 

evemx to neca^ tor evert Y If evert Y fails to occur when ever^X is negaed AocoiSngto 

r^fn^^^S?^^ ^ ""^^ ^^'^^"S cond8iof«Twe^rt X occuS'and 

TOfm^n^aances i^ Each of tlwse tour cr«erta 

received a vaUje ranging from 0 to 1 whenever two evems were evaUia^ The 
overall causal strength between X and Y was computed accoidlf^ to fomula 1. 

Causal streni^h-r(y{N4SV2 (i) 

T. O. N and S refer to ttie vakjes temporality, operatlvfty, necessity, and sufficiency, 

^frl^^ii J^HiS?!i^ ^^'^ ^^^'^ "^^^ varies Of 0. .5. 

lud^esT^ a high level of agreemert (.70 or more decisions being the same tor any pair of 

We peitormed nwltiple regression 

answer PmiO^ scores (w on GQA jud^nerts) for question-answer psks. First, there was the 

^tmSl.^22S?\'li^l2lr!^ Secondttierewas 
structural distance, the number of arcs betwe^ the quertod node and the answer node. Third. 

there was causal strength, M measured above, which varied from 0 to 1. FOuftti,«wew^ 

overtop). FWh, there was topic faniliaray. Millls (1989) ooBededfOT^afityr^frigs from college 
students wi the24 texts: the v^ famWarly ranged from 1 (very unf amHUr wAh the to^to 6 
(veiyfanw^^tbetopk^. Slxm, mere wMtoiowledgedomatei,wtth(ftjmmy coded vartSSi 
corresponding to the technoto^cai, t^iogical, and physical science domains. 

In analyses of how, when, enable, arid CONS questions, knowledge domain virtually never 
interacted with ^ of ttie other pr -tator variat^, so we wW not report separate rewesston 
analyses on each knowledge domain. Howver, tor why questk^. there were f requwt 
interactions between knowtodge domabi and the arc search procedure, so separate regression 
analyses wifl be reported on tochnotogkai, btolo^. and pf^toal sdence domains. 

Table 2 presents resulte of the nwtlpte regression analyses on answer produd^ lnall7 
regression analyses, the overafi muB^ regresston equatton si(^itftoan^ fxedicted anstwr 
production scores. The mean percentage of variance ejqyiained by the equatton was 32% (whe 
weighting the five qi^ton categories equally). Standardized beta-weights are presorted for 
each predkSor variable, along with an indfcation of whether the predfclor had a signifkart 
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sen^wrtel oorreialton. Ay muttpto rsgression anafysos showed sij^tean! effects of arc search 
{mean fsma - .44) and stnic&iral trance (rn«in heta - -24). Cmtsai s^rengfih was signtficant m 
only 1 out of 7 analyses (mean beta » .04) and aroument overiap was signifteant &i 2 of the 7 
analyses (mean beta - .05). hi 4 of the 7 analyses, the answer pioductlon scores ctecreased 
significantly as a function of topic f amiiiaiKy (mean beta « -.08). 

m a set of followH^ recession analyses we added Wera^ton terms for three compor^nts of the 
convergence nwchantem, rHuneiy arc search (A), stnjctural (^stance (D), and causal strength (C). 
There ware four pos^ interadion terms: AxC. AxD, CxD. and AxCxD. The incision of the 
imeraction tefflislna«ased the anwrt of explafaied variance ff^ Ttwcausai 
strength variable rarely intef«:ted slgntonMy with the other two convergence oon^)onents: the 
AxC, DxC. and AxCxO kiteractions were each sif^iifteani In only 1 out of 7 mult^ regression 
analyses. The AxD inaction was the only robust and con^nt imeraction (6 out of 7 
analyses). Thte AxD hmctlon reflected the pattern of data repotted eartler. That Is, structural 
distance had a large impact on legad answers but a very mpact on answeis. 

Mutt^ recpes^ analyses were also perfomwd on tfw GOA MJ^^nertts, using the same 
statistical procedures that were used on me answer pitx&ttifon scores. All multipie regression 
analyses signiflcantly predicted the GOA judgnwms; the mean percentage of variance explained 
by the mutti^ regression equations w^ 57% (varying from 30% to 65%). Table 3 presents beta- 
weight for arc search, »nj(^i^ distance. «id causal strength. The arc search variable was 
significani m aO IS analyses (mean beta « .57) Mrhereas stnictural ^stance was si^»icar% 
negative fri 14 oifl of 15 analyse (mean beta - -.36). strength was sigNiWcantin only 2 out 
of IS analyses (mean beta - .02). In addlion, topic famii^rity was sign^:af«y positive in 2 
analyses (mean beta • -.03) and argumem overlap w^ signtticam toi 3 out of 1S analyses (mean 
beta » -.03). 

In a set of foBow-t^ muit^ re^pession analyses, we added ir^rsKaton tenns for arc search, 
simctural distant^, and causal strength (AxC, AxD. CxD. and AxCxD). Ttw AxD imeraction was 
signifteant inmost^ the analyses (9 out of 15) as was the in the analyses of answer 
prodution scores. Once again, the causal strenj^h iMBcfidor rarely imer«:ted w«h tl^ other two 
convergence components; the AxC, CxD, and AxCxD Interactions were signifteant in only 2 out of 
45 cases. 

To summarize, these mufiiple regression analyses provided conslstera and robust support for the 
arc search procedure and for structural distance, but failed to show eflwis of causal strength and 
argumem overiap (Le., two (ttnwisions of oonstraM satisfaction). We have a piat^ible 
explanation for the fining that causal ^ngth had no bnpact on answer production scores and 
GOAjudgments. Individuals may need a suffictent^ deep level of understani^ ^XHit the topic 
before they can construdcwmai^rpretationst^ the events. That is, an anaiy^ of temporality, 
operativity. necessity, and sufficiency requires a rkA\ body of worW knowledge. The coHege 
studems probably had a very superficial understanding of the 24 texts so effects of causal 
strength failed to emerge. 

Decision btennkHi tor CiHA fii^iffpfn^f We analyzed the dedsion latencies Of the GOA 
judgmertfs in the s6»v9 experftnem. The mean dedsion latencies varied from 2.16 seconcte for 
CONS queetiiHis to 3.16 seoMids for when(be«ore) questions. TaWe 3 shows the outcome of 
the multiple regression analyses on ttie decteion ^enc^ The regression equflion sigrtificantiy 
predicted latencies in a015 equations and accoumed for 12% of tlw item valance. 

Table 3 shows beta-weig^ for the arc search, stnjctural trance, and cmis^ strength predictors. 
Arc search was sls^iiflcantly positive in 8 out of 10 ar^iyses in which legal answer 
antecedents (mean beta - .23); in these cases, decfaiion times were tonger for legaf arswers than 
for iil^al armwers. Arc search was not significam in any erf tfie arises in wftidi corsequerx^ 
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^■^2^^^^^' whentbefow), »id 3 »rtiy(ln oidw). StnictumI dtetance was 

s^nrftewi^ negative m 7out 15 analyses: the sign was negative m 13 analyses. Thus, the 
^ision latencies decnsase as a function of the number of arcs twtween the queried event and 
the answer event. Causal strength was sigt^ficant m only 1 out of 15 analyses. Topfc famiiferitv 
and ar^ment over^ did not have a consistem s^jnif Icam impad on decision latencies. In follow, 
up mutt^ re^Kassion analyses with imeraction lemw (AxC. AxD. CxD, and AicCxD). only 4 out of 
the 60 interaction tenrowwe statistically signiflcam. /. «7 

The th« GOA latencies decreased as a function of structural distance is ^loomp^ tile with a 
spre^ng activation explanation of distance ette«s. A sj^eadinQ activation explanation would 

fsimionsh^) between distance and latencies. The stnictural d^anca evaluation 
reflects (fiscnmlnation jvocesses r«her than search processes (Wagoner & Wender, ^m) That 
IS, It te more mioutt to disoinmate the relativa temporal order of two events when the two events 
are strui^rai^ close. 

Summanf cwnnients on mtposwofy These studes showed cor^tetenS support for 

the arc search procedures and stn«aural dfetance. but veiy tttle support for constraim satisfaction 
(causal strength and ar^iment overlap). The fact that the constraint sat^fiBition came up empty 
suggests that the compretwnders must have a deep level of domain luKnwiedge before they can 
assess causal relationsh^ between events: the college students hi this study probatiMy did not 
r^ve an impressive amount of badcground knowiec^e for the scientific mechanisms depicted in 
the text. 

The arc search pioceftjres «Td stojctural distance wwiW go a tong way ^ converging on a small 
number of good answers to a question. Given that the texts m this study tisa 5 events, there were 
4 explicit candidate nodes thstt were potential answers to e^ question. On the ave^. haH of 
these answers would be piuned out by the arc search procedure, which would pursue either 
causal ar^ecedents w causal consec^jences but not both. Stojctural chance would further 
decrease the space becmise answer quality decreases exponentiaBy m a function the number 
of arcs between the candidate node and the (^eried node. According to our bm-m estimmes of 
the dampenning cuive, 1 .3 out of the 4 lodes would be good answens. a convergence ratto of 
.33. As the database grows in vohjme and the graph structures have more diverse paths the 
convergence ratio gets closer to 0 {Graesser & Frartidin. in pre«). 

With one exception. QUESTS arc sean;:h procec^res are »tequatety cajaured in Figure 6. The 
exception lies in the why cpie^ns, which interad wtth type of knowledge domain. Causal 
antecedents are prevalent when why-questions are answered in the context of physfcal systems 
whereas causal consequences are more prevalent when technological and blologfcal systems are 
probed. This difference was e;qNaiiwd by postuiming that a teteotogical goal hieraichy was 
supenmposed on the networks m bioiogkal and technotogical systems, but not In physfcal 
systems. ' 



Questk?n Answerim;. in tho rontaxt of Narrattsn* TA»t 

In a series of ^ies we oolected questk>n ai^wering protocols and GOA iudgmenis after college 
students comprehended sbn^ narrative texts. Unlike the expository texts hi the above studies, 
college students ^nerate a large volume of knovMge-based irtferenc^s when they 
con^fwnd sim^ stories and scripts. Accordbig to some estimates, ttie vowme of knowledge- 
based inferences is 4-5 times gmaier in nan«ive text than expt^itory text ((Cesser, 1961 : 
Graesser & Clarit, isas). The acttons and events depicted in nair^ have a ck}8e 
correspondence to mundane everyday experiences so inference mechanfems are more 
automatfc. Narrative is an exceOent genre to stucfy because ttwe is an extensh^e inte»play 
between episodk; knowledge and generic knowledge. 
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There has be«n a deal of aqapoit lor QUESTs csom er^nce mechanisra in studies \m 
have cosecmd <^ie^ «isw^ pm^ols after sitjfects cofnprehend narr^ives (Graeeser . 
i9/*8, 1981 : ^aesser & ClarK 19^; Graesser & Murachver. 1985; Qraesser el al.. 1980: 
Graesser et aJ.. 1981). Answer proc^Jdton scores tor how, vi^n. wtwe. enabie, and CONS 
questions are 8i(^^fic4f% iMeiScted t^f QUESTS arc search prpceduree. by stnictural <fistance. 
and by m»iy tensions of oonmraMsiaMaction. Tti^ findinj^ were repoiled ^ studies 
published before iWs contract, so they wH not be covered in this report. The lesearch conducted 
in this coniracs focused on GOA jiKigments for quealion-answer pairs after subieds 
comprehenited short nanative passages. This research has been written up in tvK> stut^s 
{Gok«ng. Gmesser. & Mttis. in press: Graessw, Lang, 8> Roberts, 1989). 

College students at Mem^is Stme Urtversity first re^ one of two stories and then jud^ the 
quality of particular aiswere to particular questions abort a:tiofw and even^ Oneol 
the dories is provided beiow, fofiowed by tviio exanif^e <^stton-answer pairs. 

Thfl C2ar and hte Pff fl7h*ftiB 

Once there was a Czar Who had three loveiy daughters. Onedaythetfvee 
daughters went waBOng in the wckxIs. They were enjoyi!^ themsehres so much thai they 
forgot the time and stayed too (ong. A dragon itkf nappiiad the miee dajgiters. As they 
were being dragj^ off. they cried for fwip. Three heroes fwardttw cries and set off to 
rescue the dnjghters. The heroes came, fought tfw dragon and rescued the maiders. 
Then the heroes returned the daughters to their palace. When the Czar heard of the 
rescue, he rewarded the heroes. 

Why did the heroes fight the dragon? 

The dragon kidnapped the daughters. 

Why the heroes fight the <^agon? 
The daughters were frightened. 

The first answer to the qu^ti(»i refers to an actton that is explicttly stated In the text whereas the 
second answer m a luiowiedge-based irtf erence. Our gwl was to test QUESTs amy to ej^in 
GOA ^ments and dec^n latencies both for answers that were ir^erenc»s and for ans»¥ers 
that were expiictt text st^ements. 

The procecfcire forcoBec^ GOA jud^nents and i^encies was exactly tfw same sa that 
descrft>ed for the ^udy on exposiiDry text. On each trf^ the sut^ec^ remi the (^e&Hon sA their 
own pace by preying a bi^n. After a bri^ .5 second d^. tl^ screen w^ erased itfid replaced 
wrththeanswer. TTiesi^biedftidicatedtheirGOAjud{^nem by pressing 
answer versus GOOD answer). In some stud^vveooftscted discrete GOOD/BAOjudg^nts 
whereas in others we effected GOA Judgments on the foOowing 4>poM scale: (1) bad answer, (2) 
pos5a>ly an aooept8)ie answer. (3) moderately good arttwer. and (4) very good answer. Latencies 
were not am^ed when judgmerts were cotecied on tfie4i}oint scale. FWy sut^s pnMied 
binary GOA ^jdgments whereas 60 provided GOA rMbigs. An equal nun^r of subjects were 
assigned to eat^ of tt^ five que^ condition. 

Maledaifi. The narrative texts were two short stones that have been investigated exienslvetybi 
previous stu^ (The Car ^oiy and a stoiy about ffiiaf« and a dove), bitsntional actions and 
4 events were selected as queried nodes fRND each passage, yi^ding 18 (^leried nodes 
altogether. For example, the queried actions selected from the Czar stoiy were: The daughters 
waiiied in the woods, the dragon kidnapped the daughters, the dragon c^s^ged off the 
daughters, the fieroes fought me dragon, and the heroes returned the daughters to the palace. 
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Sixteen ansvver noctos were associated with each of the 1 8 queried noc^ yieidtoig a total of 288 

Why. how. when, enable, and CC^iS. The woreMng ei the questions and cmeaories varied 
somewhat among the question category. For ^Tan^ ?\rm^^j^Z^^^ 
IP u om thfl t^floon then the five questions would be: Why/howAwhen did the her«iSfiaSt5 
«iml!S^'S^?^ ^ and What are the consequences of the 

t1Sl*^^t"^y? ^ "^^^ Werences (82%) as opposed to ex|« statements in 
the pass^ The Werences were sampled from ques^ ai«werino protocols collected by 

'"»^««*y.<^'Aprotocolswefeco^foreacheK^ 
siatement fn the two sioftos. Each passage statement was rmmd w8h \hs five oiedion 
cat^fwsjwhy hOKv. whm\. enable, CONS); 10 sut^s generated answers for each of the five 
^eaion c«egoriw. Assod^ wfth e^ particuteir qi^stion (e.g.. Why did me heroes ftaiit the 
dragon?) was an answer distrSxition wWch Eluded afl statemem nodes that were preduc^ds 

answers 1^2 Of more out of the 10 siAilects, Associated wBh e»;h answer was an answer 
production score. 

Our jTOthod cH sampling nodes from the answer distrRxjtion was stra*1lted and random, with a 

numberj^ criteria tr«t needed to be met before a node was accepted In the answer sa^ 
Regardftig salification, answms were selected from the answer d^stilbutions of aB five question 

^.f!^^^^!?^^?'^^^' ^ randon^ from the answer distitxjtions 

of st^ement N-Z, N-l . N. N+i . aid N^^, siKih th« N refers to tl^ queried node. The answer 
distrtoutions ^sodated wm passage st^emert N were weighted higtwr than me <«her positions. 
A noctewas ehminated frem the sanple tf 11 was a style spec8fc:aion (^^^^^ 
time index (e.g.. in the afternoon, yesterday). We focused w viewers that were events, ^ates. 
actions, and goals because nodes fti these categories can be aitiaiiat^ as complete sentences. 

When tlw answere were prepared, the even» and st^ were articul^ In exa(% the same way 
across tiK five question carries. Events and i^es were declarative sentence in the past 
tense, witfi noa>nnective6 preceding the statement (e.g.. The daughters were frightened, It was 
a nice day). However, there were ffcJduatlons among questN>n categoi^ as to whether an 
answer was aiUcutated as an aSion {e.g., Ths dragon kidnapped the dsjgmers) or a goal (The 
dragon wanted to kkkt^ itm daughters). The mtos for articulating tfw answere un^onnly gave an 
f St^^ »te best shot « being jud^ m a good answer to a question (Graesser. Lang. & ffebeits. 

1 969), 

Vanabtes and muttteto mqntgsiQn aniiiysffli . There were three (indent measures: GOA r^ing 
{on the 4.point scale). GOA jud^nertt (the bkary GOOMAD decision), and GOA jud^nent 
latency. Three separ^ sets muKipto regression ar^lyses mre peifomwd, correspoiKfing to 
these three depended measures. Whenever a mult^le regression analysis was peifemwd, 
variabilliy among question-answer ttems (averaging over si^^ds) served as an enor temi; the 
beta-weigf«s repoiiod in the sut»eauent tat^ are bai»d on these Hem analyses. However, in 
an tests of statistic^ ^gnWcance. Wb assessed variabftty among sUH^s &i addition to variabitty 
amongitems. Whenvari^)i«y among sut^ctsw» assessed, we perfomied multiple recession 
analyses on dividual st^^cts and te^ed whether the b^weigh^ of each imfictor 
signif icar^iy differed from 0. 

The predictor variables fei each rej^ession analysis are listed and spectfied below. Somepre<ft«>r 
variables were theoretically interesting from the perspecth^e of the QUEST model, namely those 
associated with the convergence mechanisms and the infomtation sources. 
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between the entry nods and the answsr node in the daiabasa. The scorn was 0 « mere was no 
fegaipath. it should t>e noted that the texUiase oome^ on^ e^q^ilcM passive n^ Whenthe 
answer was an biferenoe, &m placed at m Virtual location- in the texttoase. as if the inrerence 
were an expiidi ^^m^. The me search proc^t^ires cfiffered among the five <^ie«lon 
categories (as (ta^sed fci Graesser & ©ark, 1985: Gnesser & Kluracawer, 1985). so the values 
on this variatsle d^nded on the question category under cor^eration. The mean arc search 
scores were .55, .46. .68. .68. and .40 tor the why. how. when, enable, and COI^ questions, 
respectively. 

{2) StructUfald i aanCfl . This was tf% nun^r of arcs between the entry node and the answer 
ncMje in the texmase. Wherover two nodes were on muttipie paths, sm^ural distance was based 
on the shortest p^. The mean score was 1 .7 a^. 

(3) COM l fa l nt satistiffitlffn . This vanat^me^red the extern to which the answer satisfied a set of 
semantic and conceptual oonstra^ of the (^eried node. The values v^ied from 0 to 1 on each 
dimension, specifying whether tr» corstraint w» not satisfied versus was satisftod, respective^. 
There were five j«mensiorw: argument overlap, causal sfrength. temporal compati>i%, planning 
conpatlbiay. and f^ausftiltty. These dimenstons were defined earlier in the section thm 
described the QUEST model. Two independent raters provided HJdgmenls on each (^mansion 
and achieved a satisfadory degree of rsliabiHty (i.e., between .75 and .96). An over .- 1 constraint 
satisfaction score w^ computed, consteting of the average of ttwse five dimei^ns; the mean of 
this score was .65. 

Number <a oenerfc tntormatton snmir^ 7Ti^ varH^ w^ the number of generic ir^onnation 
sources th^ would si^ the answer to the question. cor«ent mmS in the queried node 
sen/sd as a poter^ bif»n)Mlon stHirce. Dec^ions needed to be made as to whether a particuiar 
answer was stored in a given lr*>miation soun^e {e.g.. whether me node Xisfr^nft d is ^red 
in the GKS for FIGHTING). These dedsions ¥^re based on sample of data collected by Graesser 
and Ciaiti (1985). Graesser and Clailt extr^ed the content eom GKC associated wUh the iwo 
stories. The content was extracted empirteatiy by a "free generation pbs quesSon answering 
method;' subjects in one group gerwrated Usls of tyf^ pn^ies, actions events, and other 
nodes In a particular GKS whereas sut^eds In arKJther gpoup answered c^e^ions about the 
content extracted from the free g«ieration san^. Asso(^ed wlh each GKS was a ^ 
statement nodes that were gerwrated by 2 or more «ib^8. Regarding the |[^esent study, an 
answer was scored as coming from an Wonnation source » K wis a member of the Graesser and 
Clark node Iwt for that frrtormadon source. Anavera^ansvverwasamemberof approxmiately 1 
generic infonnation source (mean ecpial .92). 

(5) Vert?atim Staftmftnt. This vanable spedfied whether the answer was expiicaty mentioned in 
the passage (vafcie - 1) or whettw was an inference (vafcie • 0). The mean was .18. 

(6) Ansffflf pigtiuctlon mm, This was the likeittiood th^ particular answer would be 
produced when a particular que^n is asKed in a (^estion ai^wring ta^. The answer 
production scores were extracted f -m the emj^ricai ar»wer distrftnitiore collected by Graesser 
and Murachver (1985). The mean answer production score was .06. 

(7) Queried action/ftvimt The queried node was either an intentior^l action (value • 1) or an 
event (va^ • 0). 

(8) SlOty- The Czar ^ory received a value of 1 whereas the Oove s^ry received a value of 0. 

Correlation matertces were prepared in order to assess whether there was any serious probtom of 
collinearity among predictor variables. A correlation matrix was prepared for oach of the five 
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S]^S^!Sl2^!^Ii;^K*'^ ^"^^^ ""^^ ^ constraint 

, ^ breakdown of the five dimenstons) and Uwe addiSonal vaitables 

(rwmber of conMni woids ^ answer, numbar of syttat^ fri answer, and the togartmrn of word 
frequoTcy^ cof^ri words}. Given tfim ^ matrix had 1 1 vafiat^s. tfiers mre 55 cjorrelations 
in eacfi matrix and 9S unique oonefetttons among ttw five matrices. Only 1 of the 95 oorreiations 
was grsatef than .40. mere was a posfihre correlation between nun*er of oortent words and 
number of syllables, i - AiKJther 8 con^latlom had absolute vaHies d 3no ^Therefore 
91% of the correiatlons were smaN or moctest. 

QQOdngSSjjf'answerfCinA) hirtqiTyntS . The mean GOA judgments were 40. 43 39 51 and 

i^Si^'Sr^'J^r'^tT' I!2^4 ®? <^esttons. respsdivefy. The mean GOA were 
1.99. 2.07. 2J1. 2.32, and 2.37. Tatte 4 presents the oitcomes of the muRipte regressi^ 
^"S^^S? f^*«»«« QOA Msmertfs and ti» 6QA ratfngs. Beta-weights are ^sented for 8 
predictor variaWes. sei^ti^ by question category, mm aver^ng over question categories 
and considermg ttein variance, the muKlpie regression equatioi^ accounted for 50% of the 
vananceofbinaryGOA|udj^Twm8and52%(^thevartoKeofGOAratln«. Afiof^mufttoie 
regression eqi^ions si^tficantly {^edicaed the GOA d^^isions. Tabte 4 indlcmes whethw^ 

predirtor variable was slgr^cart hi me Item analyses and In Uie subled ama^ 
predicior as si|^ilftear< tf a was stmistkaily sigi^icani ^ both th^ 

The regression equations were almoa ideniteal for the binary GOA decisiorK and the GOA 
rating. Thlscanbelflustraiedbycon^HitlngthBpropj^tionofbeta-weigf^thaihadlhesamB 
qualitative witcome, i.e.. both were significantly positive, both nef^ive. versus both 
nonsignificam. Of the 40 beta-weight comparisor^ in Tabte 4. 35 had the same c^ative 
ouU»rm(Se%). The values of the beta-weights were also quanitetivelysbnHar. When the t^nary 
gom ^{^ments w^e compared to the ratings, the mean beta-weights (averaging over mjestton 
c^e^) were virtua»y identical: arc search (.45 versus .43. respedively). stnjduml d»»ice (- 
.09, -.09). oonstraint sat^action (.23. .23). Infonnmion sourt^ {M, .01). verbatim ar^wer (.04 
.06). answer prxx&ctton score (.26, .29), queried action/event ( Oi, -.01). and story (-.02, .05). 

Support was found lor aU tlvoe components of the anvergence mechar^. Arc seareh arxi 
constraint satisfaction had signlficai% positive beta's m all 10 multipte regression analyses. 
nH!^'^ ^ sigitificaf^ly negative beta'^ in 7 out of 10 equations. The when and 

CONS question did not show consistent support for stnJdural diaance. In addftion. tlw bivariate 
correlations were perlectly compatflDle wtth the beta-weights In Table 4. 

We pert onned some foUownip multipie regression analyses that assessed Interactions among the 
three components of convergence: arc search (A), stmctural distance (D), and constraint 
satisfaction (C). That is, we added tour interaction terms to the muttiple regression ec^ation (AxC 
AxD. CxD. and AxCxO). The three bfleraaiontefflrwsi^tificantrymcroj»ed the amourfl of 
explained variance^ the ttem analyses from 51% to 52%. Twon^ralgn»^of J^stlons 
amerged on the basis « the patterns of 3-way WerMtions; wfy. how. and eni^ questions 
formed one g^oup wfmas the when and CONS questions formed the other. Signlfteani 3-way 
interactions were found fw the first group but not lim second group, figure 9 plote the 3-way 
interactions lor the two j^oups of subjects, segregming tfw t^nary ^ments and the ratings. 
The b-wei^ (i.e., nonstandarcSzed regression coefflciems) of the three main effects and four 
interaaion tern* were mmi to generated the va^ in Figure 9. As shown in Fi^re 9. «mdural 
distance con^8tcr% yielded «« Ihws for when and COr^S questions. For why. how, and enat^ 
questions, however, structural dtetance significantly decreased GOA scores for the legal answers 
that failed to sat^ oon^raints and for the illegal answrs that satisfted constraints: the other two 
tines were essentially fiat. 
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The above results f to sti^poft a sequentiai model which assumes that the arc search 
procedufe is executed b^fe constraint satisfaction, tf anything, U^se two oon^xments are 
peifomied sbnultaneously and appear to be ad^tive. Both ^gal and iflegai answers show lobust 
effects of constraint s^ialaction. Tlhe AxC interaction term wis nonsigrtfieantfoi 7 out of 10 
analyses, w outcome that would tend U)su;^ft an addMive model. Hie GOAKjdgmems were 
consi5ter«V '^Qlw ^ iHegaJ answers that smisfy constraMs than tor iOegai answers tha^ 
sattefyopnstrairts. The dmaoons»i»% argue iH^ibtst a sequentiai moctol in which the arc sean:h 
procedure Is executed prior to the constraint smisf action component. 

We peffomied a set of nxAipie regression »iaiyses wNch segregated ttie five dimensions of 
constraint si^action. When averaging aoo^ tfw 5 question categories and two GOA scales, 
the mean b^-weig)ns were -.02, .10. .03. .14, and .19 for an^ment overtap, temporal 
compatlb%. plarffi&ig compirtibmy, piausibiilty, sma causal strength, re^sectiveiy. Causal 
strer^ was si^iificam in ati 1 0 analyses: temporal 0Dnpatft}iRy and piausiMlity ¥vere each 
s^ificamin7anaiy6es;piann^oon^}ata>ffitywasslgnfflcantL''4ai^fy^ overlap 
was5igr«icantinonty2araly8es(wtthoi»poi^andonene(^lvebeta). Whenconsictering 
these 30 si^ificant effects. 29 were fti the direction ihm vkhjM be predicted by QUEST. 
Therefore, there was evMence for 4 out of the 5 dimensior^ of constraint salk^action. 

Muitipie recTBssion analyses were |;Mitomied on those answers that had answer production 
scoresofO. Th^ answers to a particular question were never generated by sut^s who 
supplied Q/Aprdooois^ the Grader and Mura(^im(l^ study. The same re^vssion 
analyses were petfomed as those bi Tabte 4 exceptthatanswer production scores were dropped 
:.^ecause the value was always 0). AniOequiOionssigiiAicamiy predicted the GOA)udgmerts. 
accour^ing for ^% of the item variance overall. The beta-weights were quamatlvely and 
quantitittiveiy the same as those bi T^ 4. Of the 70 beta-wei(^ toitf^ anai^, 65 had the 
sanwqualtti^mjtcome as those bi Table 4. The mean beta-weights of the theoreHcaHy 
interesting predictors were Quantttaflvely sin^for (a) the eitbe answer sample and (b) those 
answers with an answer produoion score 0: Arc search (.44 and .44. respectively), constraint 
sat>sl2K:tion (.23, .28), stnictural distance (-.09, -.09), generic Infonnation sources (.03, .02). and 
verbatim answer (.05, .04). 

The muitipie regr^sion analyses consister^ failed to suppon the predii^ion that GOA 
judgments would increase as a function ^ number of genieiic ir^rmatlon sounds. Only i out of 
me 10 analyses \n Table 4 showed a signfflcar^ effect forthte predictor. As discussed earlier, 
there was some foundation for anticipaUng a c: "viiinear reiatior^hip between number of generic 
Information sounds and GOA. Specfficalbr. ar^wers that come from many infonnation sources are 
uninlormatlve whereas answ^ from no infomratton sour»s are somefimws dHf icul to Werpret. 
In order to test for a prasible curvilinear relattonsh^, we perfonned 10 regression analyses with 
two predictorB: Number of Inlomi^ion source (I) and rz Tf» l*'2 tenn was nrt significant in any 
of the analyses. Whmw^rMtrlcted our analyses to answers that p^sed the arc search 
procedure, again there were no signHicar^ r*2 tenrn FUmNy, we assessed whether i interacted 
with any of the ottier prec^ctor variables and once e^n came up empty. 

Decision latencias tor GQA turiffmenty. Mean decision iatendes for the binary GOA jud^nts 
were 2.39, 2,40, 2.79, 2.51 , and 2.69 seconds tor why, how. when, enable, and CONS 
questtons, respectively. When we peilomied muKlpie regression analyses on these latencies, we 
included number of content words, number of syllables, and word frequency m predictors in 
additton to the other 8 precSctors in Tatoie 4. 

Table 5 presents the beta-weigl^ from itm mufi^ regression analyses, segregated by the 5 
question categories. Separate analyses are presented on the comi^te set of Kerns and those 
question•ar«we^items with answer {voduction scores of 0. Each of the 10 re^^ession analyses 
was statistically significant, accounting tor a mean item variance of 35%. The beta-wei^ were 
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fTJi^X^^ ^"^^^ ^^"^ set and tfw set with answer production scores of 

w Arc sean:h (.04 versus .OB), stn^turai distance (-.04. -.03). con^as^ smS^iozm 

{.17, .13). conter« wonis (.22. .22). syllables (.28. .28). and word frequency (.00. .00). 

^^a^!^ t»ta-wij^ in 5. iegal awers to wfiy. how, and enable questions 
f^^a^'SSSrJSSS?'^'^^^ In^mrast/theaic^^bS^ 
^i^^^^^^J^'^^l^^' nwisigjUffcart or negative. It should be noted thSme 
«nS,SS^!^ occuired for the expostfoiy texts reported earter. Tlw beta-wetohte for 
^ral distoi^ were negative in 8 <Hit of 10 analyses, but were never signs JrtB^act that 
they were negative is er«rely cons^ei^ with the earlier studies of expos^^tS The be^ 
weights lor constrato^ sjdi^action were s^nificant in only i out ofio ai^^ 

'H^J^^^^,^^^'^ regression analyses li^ assessed interactior^ among the three 

TliereZ.^pSed 

Ss ^a^S^i^^^SlLf did not interect signWcantfy with the^hSrw« 

Slf 2^^7Sf2S?^J^L^ ^ ^ °" ^ ^erection tenn 

f^Jl Wfctor variables in TaWe 5. This AxC bnerection was eifter significant or 

aunog »«nrf ^ f^ wr^ how, and er^ que^ions. but not for when and CONS cXl^ n 
shouW be noted mm the GOA judgn^nts also manifested natural groupings for why 
enable questior^ versus when and CONS queens. »'»~>«*»8s ror wny, now, ana 

Sl/'!^™?!'^ ir^ciion between arc search and constraim smisfac^ion. se^^lng why. 
^i^T^,^!^'^^'^''^^^'^'^^^' EachWerac^XgSneS^ed 

^^rS^^^^^:J^^^ oonstra W sat^siUon had essentfa^ no inpact on decision 
teter^. For the other ttiree t^estion categories, however, the Blegalw^ ttSfaltedto 
Si^lSSriir^ than than the other ifwee item categoric (i.e.. legal answen^ 
SnStf ^ oonstraWs, and Blegal answers satisfy^ 

SLf^oSi!^ "^^T^ ^ '^^ Another way of view^ these cSta 

hat tl^answeiB whk^hred a c&crepency between arc search and cons&aW s^action 
(i.e.. lega^ answers fai»ig oor^tralnts and Ulegal mmters sattefying constrairte) had ionaer 
^SJI^rllf ®^ol"^" *I5 ^'^^^ °f succeeded on both components: this 

Summary of ftfiriifts an ifflrrathm mift . three compmits of quests convergence 

^^L^il^SSIIr^il*^' ^ *idgmer«s: Arc search, stnictural distance, 

and constraW satisfaction. The pattens of GOA jiKlgmer^ and decision latencies provided 
some c^es about the bi^»aJonanxmg«Tettwe convey The piocessino of 

Why. how. and enable questions were somewhat ««fereni from that when and CONS oiestions 
(as was the^ for the studies on exposQory text). Consider first the why. how. and enabto 
^"^^ '^"'.Jl* ^ ^ con^rakit sati^a(^n components are apparentiy executed In 
parallel When output from the two components were m agreement (I.e., both GOOD or both 
BAD), then the ^jpiopriaie decision w^ made. More time was neetted to detenu^ tfia! the 
answer Is good than the answer h«j; good answer require thm muli^ {»ftefia 
wnere^bad answers can be detected as soon as there Is a faikire on a few criteria. Whentfw 
output from the arc search component W posftJve, but the output from the corwtraW sattefactlon 
component is negative, then ad(»ional time was needed to evaluate the answer on stnictural 
distance. Sut^ apparently used stnjctural distance as a witerion for breaWno a tie between 
the arc search and the conslreint satisfaction components. 
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The ^njctuna ctetanco evalu^r consists of a process of iudg^ distance rather than a piDcess of 
searching through a stmcmrs. if anythkig, the (tedsion ttf enc^ were fieter for znstm that 
were ^ Crater di^yices from the queried node. Any mechanism th^ enphasizes search 
processes, such as spreading »:^atic»i or marker pass^, would have priadicted longer latencies 
for structurally di^nt answers. 

Structural distance (»d not piay a significant role wtm individuals judged answers to when and 
CONS (^estions. The arc search and jonstraint satisfaction components were processed in 
paraUet, with no speed advantage for good answers over bmi answers. In should be noted that 
Graesser and ft^rachver (1 985) also reported that stmc^nal d^ance plays a minimal role for 
CONS and when queMlons in Q/A taslcs. 

There was no evidence for a sequential order in the processing of arc search and constraint 
satisfaction. Th^ two oomponente combined in an adc^ive fashion. However, tlwre was 
evidence for sequer«tol processing wl^n ^nicturai di^wice was analyzed. Stnicturai (^stance 
was evaluated ^ter the arc sean^ f nd cor^raint sat^ac^ oorrponendi were completed (in the 
case of why, how, and enable questions). That is. arc search and constraint sattefaction preceded 
stnictural (^stance ev^iation. 

The number of generic brformation sources had a negligibte bnpiK^ on the GOA judgmer^ and 
iatencies. One rei^on lor th^ rojn effect may be thm the textbase involved simpie stores with 
very f amiiiar oor^ent words. The oor^ent words triggered Q<Ss that were weH learned tvid highly 
automatized. Pertiaps exposHfoty texts on difflcuit unfamiliar topics would show greater effects of 
multiple inform^ion sources. 



Question Anawarinq In the ConteKt oi Generic Knowledge Structufay 

We collected GOA judgments and decision latencies for question-an^fver pairs in the context of 
generic itnowiedc^ ^nirtures. Coflege studerAs wme tested on approximately 500 trials that had 
the following phases: 

Genflric cencftpt- Consider the concept Of K^E 
Read questten : How d(^ a person dean the house? 
{Sub|ed prases bi4ton toiiowed by .5 second pause]. 
Judge answer : The person ge& a broom. 

(SidJject presses BAD or GOOD answer butU>n] 

The subject received different generic concepts from trial to trial. GOA jiKigments were collected 
for why, how. enabte, and CONS que^ior^ in the context of 8 dWerent generic knowtedge 
strictures: time, tree, home, gM, hero, crying, waSdng. and fighthg. These 8 GKSs cover a 
broad landscape of concepts: abstrad concepts, plants, locations, humans, events, and 
intentional actions. These concepts are t>n>^ (fi^ributed amor^ KeiTs ontoiogic^i c«4egories 
{Keil. 1979). 

The 8 concepts were selected from the 51 GKSs that were bivestigated by Graesser and Clark 
(1 985). As discussed eariier , (^aesaer arxf Clartt used a free genen^n plus (pje^ion ar»wering 
method to extract the content of each GKS; one ^wp of si^ects m^^M free generation 
protocols where^ a second gioi^ answered a why and a how (^jestion £tf}Dut each statement 
node in the free generation set. The total set d nodes included aS sti^emenis generated by 2 or 
more sut^ects. A conceptual gn^ structure was subsequently prepared for each GKS, u^ 
QUESTS mtes <A composHton and me ref^esentational system specSied by Graesser and Clarit. 
An average GKS contained 166 statemei^ noctos. 
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Four intemfonalartions and 4 events were setected as queried TTie 
It^'^P^ ansnreiB was sti(^ly (Cerent for queried actions and everts. For each queded 
actior^ there hwb 16 mwer nodes; these nodes nmrn selected using a a stratffied random 

«^ were 8 Q<S8. 4 queried actions per mmrS^^ZZ^ 
queried no^lhere were 512 unique pairs of queried node and anover node. These same 512 
pairs were twted on each quei^ c^^r (why. how. enatto. and CONS). TT»e lehSon cS 

^^J^^ ^ ««»P* ^ ^ 8 »«we« w»ro sampled per queried 

node (yielding 256 unique pairs of queried node and answer node). 

Each sui^eci pn>yided responses for 512 Hems that spanned all four question categories. Eight 
^ ^'H^^^ ^ question-answer pair. Separate groups sul^eds were assign^ to 

randomly dete.Tnhwd tor each sut^ separately. Both the GOA judgment and the decision 
latency were rffcoided by the oonpjier on each trial. 

Vaflabtes in mufliPte mmwiiStffn ana t ysfffi . The criterion variables were GOA^xIgn^ and 
decision latency. The proSctofs of primary theoretical Irterest were arij search, smictural 
distance, constraint satisfaction (the overaM meaaire, as weH as the five dimensions of argument 
overlap, tempore comp«8>Htty. planning compatibHity. piausit^. and causaf strength). We ^ 
scaled each answer on number of Wonnalion sources having trained hJdoes assess whether a 
given answer would be stored under each content vmm &i the ^er^ noS^t^c^ttte 
^ider the 9^n "How does a person clean the houser and the anmr The person gets a 
^""'J^J^ assessed whetfmr -X get broom- ^ stored under PERSON, under CLEAN, 
and under HOUSE (using a scale). Answer production scores were confuted for each 
StSe U^5^^' ^ ^ collecting QfA jrotocote from a sample of coHsge sftjdents at Memphis 

In addition to the above theoietkiaUy intereming predictors, there were several premors \m 
were of less Merest. These bx^ided the 8 cfcjmmy coded variables conesponcfing to the 8 
concepts, and dummy coded variat^ conesponding to different groups erf subnets who 
received particular 9em sets. Wo also incfc«led the nun^er of content words m the ar»wer. the 
mean imagery ratino per contert word, the mean woid frequency per contert word (the 
logarithm), and number of syUabtes In the answer whenever decteion latencies were analyzed- 
these variables are known to substantiaHy Irtluence reading times (Habertandl & Graesssr. 1^). 

gQAiudomtms - Table e presents ttiemjtcome of the nrRjaif^rep-es^n analyses on GOA 
judgments. GOA ^dfipients were slj^iiflcantly predicted In e%h of the 8 analyses, account'ng for 
an average of 35% of the Bern variance. The arc search component was signtficant in aH 8 
analyses (mean beta • .31) whereas most of the analyses showed sign«icant effects of stmctural 
distance (beta « -.12) and oonsMit satiitfaction (beta - .16). Answere haxi h^ GOA 

judgmery if ttiey mrtched a nocte in the verb QKS (mean beta . .13) and in a GKS associated with 
a noun (beta - .07). Therefore, answer quality increased as a function of the number of 
informatKm sources siw*yhio the answer. Once again. GOA iudgiT»r«s were preceded by 
answer production scores (beta ^ .4). 

Analyses were perfonned on the interaction tenre associated with the convergence components 
in exactly the same way thiEtt Interactions were analyzed in the study on narmive text Thefour 
interactton temw (AxC. AxD. CxD, and AxCxD) incre^ the amount of ex^ained Item variance 
from 35% to 37%. The panems of Weracttons were quto conpatftjto w»h our analyses of 
nanative text (see Figure 9). fstone tfw 8 if^eraaion temw were signifteani in the case <rf CC»JS 
questions. Regarding why, how. and enable questtons, 9 of the 24 possible imeradion tenro 
were statistically stgntficant (3 slj^lcant effects for the AxO, AxC, and AxCxD mtei^aions). TIw 
patterns of the interK:tion8 closely repOcaie the patterns in Figure 9, so we %irill not plot them in this 
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report. Stnjctural {^stance ^ not inftuenoe GOA judgments when arc seaix^ and constr^ 
satisfaction were m aQreement (l e.. t)o«i were GOOD or Ix^ were BAD). When there was a 
djsagreemer«, however, then GOA (teoreab^ as a functkm of stnjdural d^ance; there was a 
panicuiarfy ^e^ siope for te^ answem mat faHed coneibaSst saHsfaction. 

We perfor.r»ed some toflow-up muftipte regression sections that segregated the five (fimensions 
of constraim sidisfaction. The mean t)eta-weights for me five dimensions were: argument overlap 
(.07), temporal compatltiitty (.07), planning oomp^l^ (.12). p]ausit)illty (.13), arti cffi&»ai 
strength (.00). The percentage cH sfgemm beta's was 42%. 

GQA dfiCSion jalflncfM- The muA^regres^nanalysas on the decision iatendes were not very 
promi^. The are seanmcxHnponertt was significu^ in only 2 out of 8 analyses: ttw mean beta 
was -.04, opposite hsi(^ to me beta's hi the rmn«i»ve study. As wfthaM studies hi this contract, 
structural dstanoe had a negative b^ in a6 6 analyses (mean - -.OS) and was signtficant in 3 
analyses. The ovarii impact of constrab^sati^action on latendM was 0. Veit GKS, Noun GiCS. 
and even answer production scores w^ rarely sigr«icant (3 out of 24 analyses). Whmwe 
analyzed the InteractitHi terms among the convergence components, the effects were rarely 
signHicam (4 of 32 analyse and no agreemem bi me 81^ of the sl^itfiaam beta's). 

SummafYOffintfngit- The GOA^idgmems showed the same patterns data for the generic 
knowledge ^njdufN and the nanatlve pass^es. There was evidence tor afl three components 
of convergence: arc search. ^cSurai iMansx, and consirair< s^tefadion. These three 
components showed an Mere^lng Interaction in tt« case ^ wf^, how. and enable cNostions 
(see Figure 9) but not for CONS questions. StnJdajrai distance has an iirpactwtw arc search 
and cor^int smisfw^ion have conflicting output but not when me two component are in 
agreemeit. Regard^ ctedsion ^ncy, me times decrease as a function of stnK^ral distance, in 
support of a comparative ^xjgment mediant rather than spreading activation. Otheiwise, the 
latency data were not particularty intere^ing for GKSs. 



Question Answ^rinp In Natufatfstle CQnte»f« 

A series of stiKtos tested OUESTs an; search procedures in the context of naturalistic 
conversation and complex pragmatic environmems (Grosser, Roberts, & Hadcett-Renner, in 
press). In the previot» experfanents discussed bi mte rep^. the pragmatic corttext w^ restHded 
and pertiaps unnatural. The texts were short, unteresting, and poinUess. The true questioner 
was unltnown and the motivation of the cpjestions was unclear. The questioner (e.g.. 
experimenter, booklet, computer) did not genubiely se^ icnowiecHie from a knowledgeable 
information source. In order to assess whether QUEST to a general model, we tested QUEST in 
three different naturalistic contexts: 

(1 ) A telephone sunrey on hi^orical or cunent events (e.g., the Titanic sinking. Hinkley 
attemptbig to assassinate Rei^). 

(2) Abusfries8interactfonfciwhidiaci«lomBrask8aderkaquesik)n (e.g.. How doas a 
pafmnpBfacrBdftcafri?tnahank) 

(3) An ftitonriew between an expert and a host on a popular televtoion program or 
educational (e.g.. Nightline wim Ted KoppeO. 

If the arc search procedures of QUEST can ^count for a substantial proportion of the ai^wers in 
these contexts, then we would be bnpressed wim the K»pe and extomal yaH(% of QUEST. 

There are ample reasons for being skeptical about the external valktly of QUEST. Questions and 
answers are embedded fei conversation. The cor^enl and oonstrainte of a oom^'sation can 
potentially trarsform the "Oteral meanhg' of a question and thereby radicaiiy alter appropriate 
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replies. Suppose, tor example, that a customer viste a used car tot, points to a car, and asks the 
salesperson: Whv Is 1<MS Buick so Bypfm^iyft? some repUes are presented beiow. 

(1) The en{^ is m perfect condttion. 

(2) This 1983 Chevy is in good condition. 

(3) Why doni you look at t}^ 1983 Chevy? 

(4) What price range ^ we iooi(^ at here? 

Reply 1 wouid be accepted by QUEST; it specifies a causal antecedertt to the Buick be^ 
expensive. RepSea 2, 3, and 4 wouM be re^onak^ n^es to the questk^i In the conversation, 
but these repfles are not accomodated by QU£ST. Reply 2 does not address the questio-. The 
salesperson infenvd, by viiUie the aistomer^ questton. that the customer coukfril afford the 
Buick so the saie^MTson recommended a less expensive car. 3 Is syitfactk^aUy a questk>n 
but functionally a directive: neither of these speech act cat^}ories are accomodated by QUEST. 
Reply 4 is a qu«^n, both synt«:»caSy and functionally, and therefore is beyond the scope of 
QUEST. 



Answers to questions in ni^rali^ conversMtons are constnAwd by tt»e speech partk:^>ants' 
goals, plans, and comiTx>n ground. These pragmatic corT^K«ients were discussed earner when 
the QUEST model was aiticuiaied. To the extern tha the goa& and plaiw become more complex, 
questions are toss IStely to be sfinple infonnatfen s^li^ utterances, for exanple, \he que^lons 
may be directives, incfirect requests, conversation monitors, or rh^ricai devk:es. To the extent 
that the common ground betwmn ^>eech paitkHpams te very high, one woukJ expe« more 
vioiatlons of QUEST and perh^ more humorous or sarcastk; replies. 

The above oonsiderattor^ ftistrate some potential Hmts of QUEST but do not lirply that the 
model is useless. QUESTwouWbequlteusefultf ft couW account for 80% of the answers in 
naturalists conversions, with the other 20% being explaM by components at the level of 
conversational meanmg. 

We analyzed the repfies that indivMuals gave to why, how. when, and CONS questions hi the 
three pragmatteaBy complex environmei^s. We classified each statemerA node &i a reply into one 
of 1 7 answer categories. Two trained Ridges reliably categorized the ^atement nodes {with 
reltability scores of .85 or higher among the three stuctes). N&ie categories involved primary 
topic infonnation" whereas 8 camgorles Involved "pra^natic" irtormation. The primary topte 
infomiation consisted of the events and a«ivSies referenced by the question (I.e.. the event of 
the Taanic silking, the procedure of obtaining a credit card). In oor^rast. the pn^natic &ifomiatk>n 
addresses (a) the answerers attitude and reaakjns to the |»ftnary &ton« 
social, communicative interacnkm bmween the questioner and ai^nverer. 

« the answer was priniafy topte Wwmalion, It was a«5i(^ to one of the following 
causal antecedent, causal consequence, causal antecedent of causal consequence, 
superordinate goaVactten. st^n^itte goal/action, consec^nce of subord^e actton. time 
index, tocatton Index, and style specBteatlon, For any given question category, only a subset of 
these categories constitutes legal answers. The legal arwens to the questions are specified in 
Table?. C^38%of theceBsfaiTable7areiegal. 

We adopted l^Andrade and Wish's (1985) taxonomy of speech aas m ojr analysis of the 
pragmatteinfonnatioa Their ciassffication scheme ncAoi^ has a soQdtheorettealfound^n in 
speech act theory, but also can be used reHabiy by tr^ied judges. D'Andrade and Wish's scheme 
has seven categortos: Assertion, quemion, reque^drective. reacskm, expressive evahiatton, 
commitment, and (todaration. We included one additional pr^matic category, called "support 
infonnatkMi." whk^ starts or motKrates the speech acts in the above pr^jmatic c^egorfes. It 
should be noted that a statement node was assigned to one of these f^agmatte categories only if 
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It did not refer to the primary topic ir^fmation. Afi of the answets in the pragmatic categoric «i«re 
considered violaliofw of the QUEST model. Therefore, only 18% of the 17 categories would be 
accomod^by the am seaich procedures of QUEST. 

TelepnoM SMYCy atuttr. Experimer^rs telephoned dtizer» in the Men^jhis community, 
introduced themselves ae researchers m Memphte »^ Ur^versity. and asked them one 
question abot4 a Nstorical event or acSon. The question c^gories were why, how, when, and 
com (lue^ions. Tfwetoie. the fotowing fair questions would be generated from the svem 
IheTitaricsafTK: why cW the Titanic sink?, how did the Titanic sink?. VVhen did the TSan^ 
and What ware the consequences of the Titanic storing? 

Twelve everts and &nemiona!aak)n8 were queried wtm these tour types of (^esttoTO^ The 
Titanic sank, the ^sace shuttte Chaiienger blew up. Governor tXikakis the 1988 fmkiential 
election, the US infimton cbopped, HinWey atten^jted to assassinate PresWent Reagan, whale- 
hunters jcNned bi an effort to lesGus the Alaskan whales. Gorbachev insmuted an open d^ 
poUcy in Rus^a, Iran^ Ayatoiah rehN»ed the US emb^ hostages, the US puSed out of 
Vietnam, the Olympk: games w«re estabKshed. Nbam goi bivoived wfth Watmg^e. and Bush 
chose Dan Qus^le as his presktenttelnjnryngm^. Given ttiat mere were 12 queried 
events/actions and four questton categories, there virere 48 unk^e questions altogether. 

The questtoners vyere 7 researdi ass^tants at MSU. The questk>r^ {i«roduced themselves and 
provided some ba(*ground Inlonnatton before they asked me (^estton. The questioners ^atsd 
that they were researchers at MSU. that they were condwabig a brW suwey on current events 
and historical evei^, and that they had only one questton to ask. The que^ners ^ed 
whether the answerer wouU be wlWng to complete the sunwy. If the answerer compBed, men the 
questtoner asked the ^^lestion and the answer was taperecorded. Each questtoner coltocted 
one observation tor each of tfw 46 questions. The answerers were ^citizens in the Menphis 
area who answered the tetof^ione and suppBed cooperative answ^. Answers were deleted and 
replaced » the person hung up the telephone or answered "I dom know." 

The arswers were segre^ted into st^ment units aocordmg to Gra^ser and Claik's (1965) 
representattonal systom. Each statement node was then assigned to one of the 17 answer 
categories ^aectfled ^x>ve. The number of siaement units prodi«»d by why. horn, when, and 
CONS quesftons was 119, 141, 90, and 134. respectively. The total rwmberwas 484, or 1.4 
statement node per reply. 

When consWering aS four q^ton categories. 94% of the aiswers refened to iwlmary topic 
informatton whereas 6% involved pratpnaticinfonn^on. waWn me categories of prttrirytopfc 
informatton, veiy few of the answers vk>iated QUESTS arc search procedlires. The percenter 
of violatkMTS (withhi primary topk: intonnallon) were onfy 5%, 3%, 2%, and 7% for why, how. when, 
and CONS questions, respective^. Ncme of these peicentages sigr^ftoamiy (Mersd fiom 0. The 
answers ki the pragmMic c^gories are oonsUered QUEST vtoiattons bi adcffikm to the vk^tons 
wHhin prtoiaiy topto Intomii^. When an vtoiattons are oonskierBd, the pert^rttages of vtoiattons 
were 1 <%, 8%, 8%, and 15% for why, how, when, and CONS questtons. In summary, only 1 0% 
of the inswers vtolated QUESTS arc seartm pmcectores: or^ 4% of the answers rsferr^ to 
primary topic toformiOksn were QUEST vtolattoi«. 

The telephone sunmy context has a number of distinctive pr^matto assumpttortt thirt muti be 
considered. Ttw (^stions are not gerujine toformatton seeking questions ki \tm sense that the 
questtoner is seeking information from a topto expert The questionem ate not askb^} questions 
In order to fffi gaps in tiieir knowledge base about the Titai^ instead the questioners are seek^ 
informatton about the belies. attSudes, and expectirttons of the general publto. They are testing 
the answerers about their knowtodge of historical evertt, in a similar fashton that teachers quiz 
students. That Is, the relevant |»agmatto mode is "make me know that you know" ratiierthan the 
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pragmatic mode erf a genuine Irtoimation seekir^ question {l.e.. 'make me know what I donl 



know"). 

BusinefflitfanMfifiOT . C^iege^Mlents pretended they wen cu^>mers and asked cteffcs 
^n^^^!!^.S^iS^^!;^^^^^' * <^«toner walked inft) a bank and ffidod a tefler 
rtQff gpfls a pwaon OW n am fiartf?. The teilei^ answer w^tape-reoorded and teter transcribed 
Conpared to the telephone sufvey, ttie business cmext provtted a ri»^ 
foundmton tor genukie Wwmatton seekbig questtons. The questioner went to an expert on the 
topic under the ^iise of neecSng intonnatton. w 

The questk>nerB were 24 students fai a research methocfe ccmrse at Mentis State University 
The answerers were 144 derks and other personrwl ^ focal businesses in the l^emphfe 
community. Each quesitoner coltected 6 CVA protocols. Altogether, there were 9 ffljestions 
each of which was asked in a partkajlar setting with ai^3n>priate props. The questior^ are listed 
beky*!, with settings in parsnmeses. 

Why are people getttoig compact dtek players? (stereo store) 
Why wouU a person buy expensive sneakers? (shoe store) 
Whywo(ildapeiBontia(evitan«iE? (pharmacy) 
How does a penson get thte ciB(» card? (bank) 
How would a c»«d j^y «rtth this toy? (toy stcHe) 
How do you oook rice? (supermarket) 

What would happen if i put tWs c^e in the freezer? (bakery) 

What would h^spen If I wore another person's glasses tor a week? (eye doctors off we) 

What wouW hw©n if I wore this at a wedding? (dt^ng stor«) 

We man^iated the context that preceded the wslion. In half the obsen^attons. there was no 
context except for the expression "Excusfs me." In the oflier half of the otoervMk>ns, the 
questton wm preceded by orw m- two context sentences thm clarified tf» questionei^ motives 
for asking the questton. However, the dma analyses shov^ no cMferences between the two 
context contfitions so the dma are collapsed in this report 

The OA protocols were analyzed hi the same way as bi the telephone »iiveyMi«Jy. Themean 
nunr*er of st^ement nodes per answer was 5. 1 . 4.3. and 3.7 tor the why, how. and COf^ 
questions, respectiveiy. The vad majortty of the 634 statement nodes were from primary topic 
information (78%) rmher flian pragmatto intomiation (22%). The percentages ar^rs in 
pragmatfc categories varied among que^n categories, with means of 23%. 12%, and 34% for 
why. how. and COf« questions, respectively. I^earfya0(^ the answers with^ the primary topic 
information were oons^tent with QUEST. The percerttages of QUEST vtolattons were 3%. 0% 
and 8% for why, how. and CONS cpjestior^. 

Filmed intgfYifliaa. Sonie of the Wmed interviews were of expeits on topics in science (i.e., 
Conversattons w«h DavW Myers, the Brm Series, and the Hkjman Animal Series). The other 
interviews were of popular or contrcyersiai hxflvkAjals on teieviston pruj^ms (I.e., the Phil 
Donohue Show, Nfgt«lnewah Tea Koppel, and the Mcl^fi/Uhrerl^ews Hour). Tlwfacathat 
these intefviews aie f«med (KkS» m fcnportar^ level of pragmatic complexfty. In particular, when a 
person answers a questton, there are two ciAsses of iisteners ttiat must be t^en Mo 
consideratten. First, thwe is the intewtower, the person who asks the questton. Second, there is 
the audience who views me fUm. in thto sense, two diatogues are actuaRy being heto. Thisadded 
complexity would presumably increase the nun^ of goals and cor«trali«8 in tfw goal stnK^ure 
of the person being irderviewed. Given the added complexay In these media events, one might 
expect the QUEST model to be challen^ more severely. 
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We an^ed the quostions and ansiifm in 12 hours the above fKnedi^^ Atotaiof436 
questions wefB receded during the bitefvievvs. There were 28 why t^msttons and 41 how 
questtonSiWheraas^ when and CC^Squetiionswera very tow in frequency. Therefore, we 
restricted oiH-M of QUEST to the why and how questions. The overalnuntMif of statement 
nodes in the answers was 94 for why questions and 143 for how questtons. Themeannumt)erof 
nodes per question was 3.6 and 3.3 for and how qiMi^ions. re^MM^veiy. 

The answers were analyzed in the same w^ as in t)w telephone suf^ and bueii^ traisffi^tion 
studies. The vast majofty d amweis cx>fsisted of prbnary information [7S%) rather than 
pragmatic infonnmicn(^%). Analyses of the jarimary to):ric infonnation revealed that the 
percer^ages of QUEST vtoiatlons were 17% and 4% tor why mi how questions, respectively. 
When considering aR answers, 68% were consistent with QUEST. 

Sumnmry of three studies. The three ^ies rotjustly supported QUESTS arc seart^ pnjcedures 
when we consider tNprtarary topic mfoFmatlon, Nearly ail of these answers (95% aoossttie three 
studies) were le(^ answers. fSos^ of QUESTS violaMons consisted of p r ^ gpi atte categories 
outside of the scope of ^enxxfol. TI«mo^fref^er9praipnatlccato(^»tesinourG^Apn»occ^ 
were assertions (which were not part of the prin^ty topic Mt^mation), ooumer-que^iDns, 
requests, cfirectlves, and exf^essfve evaiuatiot^. The proportion of atmm that occurred in the 
pragmatic categories kicreesed as a ftmction a the prsomtiic complexly the oonversationai 
imer^ion. Aoconfingtoouranaiysisof the goals and pn^)mitticoiHistfalrit8,theie was 
toitowing order of the three contends on pragmatic comiNexKy: survey < business transaction < 
interview. The corresponding percertf ages of answers in the pr^matic c^egories were 6%, 
22%. and 25% 

There are a nun^f^mechanisnis that could ex|^ the QUEST violaiions. Onenwcfmnism 
focuses on the planning and agenda of ^>eech participants. A r^)}y might not answer the 
imniediatet^mstton. bid &i^ead«Sdr^ the iri^jBctt goals of the questioner and ari^^ A 
second mechanism oonsisis of Iranstormlng the Bteral question into a differerrt question that 
seems more i^)f>rDpriate bi the context. A third mechanism irTvoIvss tlw common ^ound 
between (^j^ioner and answerer. When the common ground is extremely high and new 
infonnmion is (Mfloull to oome by, the replies migM be huRKHot^, sarcastic or "off the wal" 
When the common ground i^;>pro«hes zero, one ntight have tiou^ fommi^^ any answer 
{e.g., a stranger approacMfig you on the ^reei and asltbig "Why are people gettbig compact disk 
players?*). A fourth rrachanism addresses whether the que^n is a gemJbie intormatton 
seei(ing question (see van der Meij, 1987). There should be more QUEST violations to the 
extent that a query deviates from being a genuine information seeiOng cpestioa 
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Table 1 

Composition Rules for Six Categories of Arcs 

A = source node 
B = end node 
CONSEQUENCE(Q A causes or enables B 

A precedes B in time 
{event 1 state I style} -C-> {event 1 state I style) 
IMPLIES dm) A implies B 

A and B overlap in time 
{event I slate I -^vle) "Im-> [event i state I style) 
REASON (R) B is a reason or motive for A 

B is a superordinatc goal of A 
(goal) -R-> (goal) 

MANNER (M) B specifies the manner of accomplishing A 

A and B overlap in time if the goals are achieved 
(goal) "M--> (goal ! style) 
(style) -M-> (style) 
OUTCOME (0) B specifies whether or not the goal in A is 

accomplished 

(goal) -0"> {event I stale 1 style) 
INITIATE (I) A initiates or triggers the goal in B 

{event I stale I style) -I-> (goal) 
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Table 2 

Beta-weights from Multiple Regression Analyses on Answer 
Production Scores (Expository Text) 



Pr«iic!or VariablQ 



Question Catfignry 



Arc search procedurs 
Structuial distance 
Causal strength 

To|^ famiSarity 
Argument overlap 
Knowledge domain 

Physical 

Biological 

Technological 

VariarK^e explained in 
item analysis (R^) 



tUfiiS: * significant at < .05 in item analysis 





Whv 








vvnun 




Physical 


Biological 


Technological 


.44' 


.27* 


.30* 


.37* 


.41* 


.49* 


.57* 


-.22* 


-.16* 


-.12* 


-.23* 


-.24* 


-.39* 


-.15* 


.07 


.00 


.02 


-.02 


.04 


.12* 


.04 


-.23* 


.02 


-.19* 


-.04 


1 

-.04 


-.10* 


-.11* 


-.03 


-.02 


.16* 


.05 


.01 


.04 


.09* 








.02 


.03 


.11* 


.16' 








.10* 


.07 


.09* 


.17* 


.30 


.10 


.15 


.21 


.23 


.42 


.40 
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Tables 

DlSnttndTs'(Eliglit^^^^^^^ °" Goodness-of-Answer Judgements and 



Question Group 



Arc 
Search 



L«i»WiiI;iit 



Structural 
Distance 



Causa) 
Strength 



WHY (no connective) 

Physical 

Biological 

Technological 
WHY (because connective) 

Physical 

Biological 

Technological 
WHY (In order to connective) 

Physical 

Biological 

Technological 

HOW 
ENABLE 

WHEN (no connective) 
WHEN (before connective) 
WHEN (after connective) 

CONSequence 

* significant at J2 < .05 in item analysis 



Arc 
Search 



Decision Latfin^i^ff 



Structural 
Distance 



Causal 
Strength 



.63* 
.32' 
.17* 


-.35* 
-.39* 
-.52* 


.07 
.00 
.00 


.29* 

.12 

.18* 


-.13 
-.06 
-.21* 


.05 
-.17 
- 13 


64* 
.42* 
.33* 


- 4fi* 
.37' 
-.53* 


.U 1 

-.01 

-.11 


OA* 

.JO 

.11 

.29* 


-.18 

-.15* 

-.18* 


-.11 
.01 
.00 


.45* 
.69* 
.69* 


-.39* 
-.19' 
-.31* 


.16* 

.00 

.16* 


.03 
-.10 
.11 


-00 
-.13 
.05 


-.03 
.00 
.11 


.57* 


.39* 


-.04 


.33* 


-.20' 


-.03 


.71* 


-.29* 


.00 


.25* 


-.10* 


-.16* 


.58* 
.92* 
.84* 


-.43* 

-.02 

-.28* 


.02 
.00 
.04 


.17* 
-.05 
.22* 


-.08 
-.29* 
.12* 


.00 
.01 
.06 


.57* 


-.42* 


.06 


.05 


-.08 


-.04 



Qucslion Answering 39 



Table 4 

Beta Weights of Pn^Hi ctors of noodness-of-An^w*>r , 
Segregated bv Questio n Category (Narrative Text) 



Predictor Variable 

Convergence mechanisp^ 
Arc Search Procedure 
Structural Distance 
Constraint Satisfaction 

Information Source?^ 
Generic Information 

Sources 
Verbatim/Inference 

Answer 

Auxiliary Variables 
Answer Production Score 
Queried Action/Event 
Story 

Varariance explained by 
multiple regression 
equation (R^) 



i significant at p < .05 in item analysis 
s significant at p < .05 in subject analysis 



GOA Decision 



GOA Rating 



m\Y HOW WHEN ENABLE CXm 

.49is -45is .33is .51is A7i, 

.16is -.13is .06s 

•I6is .31is .27is 



.13is 
.19is 



.03 
.24is 



WHY HOW WHEN ENABLE CONS 

.49is .45is .32is .48is .41 

".I4is -J5is -.09is 

.16is .30is .35is 



IS 



--15is 
.I5is 



.06 



.Mis 


.02 


-.03s 


.06s 


-.OJ 


.03s 


.06s 


-,05s 


.05s 


-.02 


.01 


.lOis 




.03s 


.04s 


.01 


-08is 


.16is 


.02 


.02 


.29is 
-.07is 
-.08is 


.16is 
-.02 
.02 


•29is 
.I6is 
.05 


.02 
-.07s 


.32is 
-.05s 
-.03 


.36is 
-lOis 
-.06s 


.20is 
--03s 
.04 


•26is 
.09is 
.09is 


.26is 

.02 

.03 


•35is 
-.01 

.16is 


.57 


.52 


.41 


.55 


.44 


.61 


.57 


.45 


.55 


.43 
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Table 5 Beia-wcighls of Prediclors of GOA Decision Latencies (Narrative Text) 

All Q/A Items 



Predictor Variables 

Convergence Mechanism 

Arc search procedure 
Structural distance 
Constraint satisfaction 



Q/A Items with Zero 
„ Answer Production Score 

Why How When Enable CONS Why How When Enable CONS 

.16is .09is .02 
■.05s -.07s - 04 
.lOis .01 .01 



Information Snurr**^ 
Generic information sources .03 
Verbatim/inference -.02 
answer 



.11 i .05 
-J4is -.06 



Auxiliary Variables 
Answer production 
score 

Queried action/event 
Story 

Content words 

Syllables 

Word frequency 

Variance explained by 
multiple regression 
equation (R2) 



-.27is -.22i -.17is 



-.07 
.07 

,24is 
.02 

.42 



-.03 
.25i 

.00 



.03 

.14i 

.26i, 

.20is 
.05 



.38 .29 



10 




OA- 






. J 2 


% MX. 

-lOs 


- 07 


OS 


- ns 






-.06 


.05 


- 02 


01 


OK 






-.04 


-.05 


.00 


.01 


.12. 


.07 




.03 


-.03 


-.09; 


- 04 


- ()'> 


- Id 






f\ A 

-.04s 


.16u 


-.25i, 












.05s 


.04 


-.06 


.03 


.04 


.04, 


.03 


.16i 




.05 


.18 


.19 


.10 


.14 


.17is 


.165s 


•36is 


.20u 


.26is 


.18,, 


•17,, 


.29is 


.30i, 


• 24is 


.35is 


.21,s 


.28i, 


.30i, 


.06 


.00 


.02 


.02 


.04 


-.07 


.00 


.26 


.38 


.43 


.38 


.30 


.27 


.38 



i significant at ^ < .05 in iicni analysis 
s significant ai i> < .05 in subject analysis 
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Table 6 

Beta weights of Predictors of Gorxiness-of-Aiiswcr Judgements (Generic Concepts) 



Predictor Variable 

Convergence Mechanism 
Arc Search Procedure 
Stiuctural Distance 
Constraint Satisfaction 

Information Sources 
Verb GKS 
Noun GKS 

Auxiliary Variably 
Answer Production Score 



WHY 
.43* 
-.13* 

*i2f H 



.10* 
.16* 



Queried Action 



HOW 
.42* 
-.12* 
.23* 



.10* 

.05 



ENABLE GCHVS 
.19* .37* 



.13 



.16' 



.01 

.27* 



.15* 
.03 



.20* 



-.14* 
.25* 



.05 
.07 



.15= 



WHY 

.36* 
-.14* 
.13* 



.12* 
.04 



Queried Event 



.09 = 



HOW 
.15* 

-.10 
.04 



.26* 
.13* 



18* 



fiNABlJ: CONS 

.31* .27 

-.10 -.26* 

.04 .11 



.17* 
,12* 



.06 
.07 



10 



Variance explained by 
multiple regression 
equation (R^) 



.42 



.46 



.31 



,34 



.35 



.30 



.36 



.25 



* significant at ^ < .05 in item analyses 
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Table 7 

Answer Categories Predicted bv thf> o gEST Model. 

ANSWER CATEGORY CXJESTJON CATEGORY 

Queried Events WHY K}N CONS V\IHBSJ 

Causal Antecedent X X 

X 

Causa! Consequent X 
Causal Antecedent of a 

Causal Consequent X 
Time Index X X 

Location Index X 
Style Specification X 

Queried Actions 

Causal Antecedent X X 

X 

Causal Consequent X 
Causal Antecedent of a 

Causal Consequent X 
Time Index X X 

Location Index X 
Style Specification X 
Superordinate Goal/Action X XX 
Subordinate Goal/Action x 
Consequent of Subordinate 

Action X 



The X signifies that the ar-wer category is a legal answer according to 
the QUEST model. 
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Figure 1 
Components of QUEST: 
A Model of Human Question 

Answering 

Interpreting the question 

Parsing the question into a logical form 

Identifying the appropriate question category 

Information Sources 

Episodic knowledge structures (text experience) 
Generic knowledge structures 
(concepts, scripts, frames, etc.) 

Knowledge is represented as conceptual p;raph structures 

Convergence 

Interg^Ctipp of nodes from different information sources (plus 
structural distance) 
Arc search prorpHtir^g 
Constraint satisfaction 



Pragmatics 

Qoals. of questioner and answerer 
Common proun^ 



Figure 2 
An Example Causal Network 
on Nuclear Power 
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State 1 

Tank surrounds 
atom splitter 




State 2 
Water is in tanic 



State 3 

Generator is in nuclear 
power plant 




ERIC 
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Figure 3 

Information Sources for a How-Question 




Q/A 

Procedure 



how <event> 
question 



Information Sources 
Specific General 





HEAT 
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Figure 4 
An Example Goal Hierarchy 
with Goal Initiators 



7 Event 
Bill became angty 
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A Taxonomic Structure 
and a 
Spatial Partonomy 
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Figure 6 

Arc Search Procedures for Events and Actions 

Causal Antecedents 



EVENTS 




Why? 
How? 

What enabled? 
When? 

Causal Consequences 




What are the consequences? 



ACTIONS Superordinate Goals 

Why? 
CONS? 



R 



R 



Subordinate Goals/Actions 

How? 

When? 

Enable? 
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Figure 7 
A Goal Structure 
Running Parallel 
with a 
Causal Chain 




Tlie turbines produce 
electricity. 



Steam drives a series of 
turbines. 



Tlie water in tlie 
surrounding tank is 
iieated. 



Heat energy is released 



Atoms are split into 
particles. 



'J I 



Figure 8 
Question-Answer Matrices 
with Answer Production Scores 
for Scientific Event Chains 
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WHY (overall) 

Answer position 



WHY (physical) 

Answer position 



WHY (biological) 

Answer position 



WHY (technological) 

Answer position 





1 


2 


3 


4 


5 




1 


2 


3 


4 


5 




1 


2 


3 


4 


5 




1 


2 


3 


4 


5 


1 




.14 


.10 


.05 


.21 


1 




.13 


.11 


.05 


.08 


1 




.09 


.09 


.08 


.30 


1 




.20 


.11 


.03 


.27 


2 


.17 




.23 


.15 


.12 


2 


.34 




.17 


.05 


.11 


2 


.08 




.19 


,14 


.1 1 


2 


.08 




.33 


.25 


.14 


3 


.19 


.26 




.21 


.15 


3 


.36 


.47 




.08 


.09 


3 


.08 


.16 




.31 


.22 


3 


.13 


.14 




.23 


.14 


4 


.08 


.22 


.13 




,30 


4 


.17 


.41 


.25 




.13 


4 


.03 


.17 


.08 




.47 


4 


.05 


.09 


.06 






5 


.08 


.13 


-11 


.23 




5 


.14 


.25 


.19 


.38 




5 


.04 


.06 


.11 


.19 




5 


.05 


.06 


.05 


.14 





1 

2 
% 

m 

4 
5 



HOW 

Answer position 



ENABLE 

Answer position 



1 


2 


3 


4 


5 




1 




3 


4 


5 




.13 


.05 


.06 


.06 


1 




.11 


.07 


.05 


.06 


.30 




.08 


.04 


.01 


2 


.36 




.11 


.11 


.03 


.22 


.31 




.08 


.03 


3 


.26 


.40 




.12 


.04 


.09 


.19 


.25 




.04 


4 


.15 


.26 


.31 




.06 


.10 


.11 


.15 


.34 




5 


.10 


.21 


.18 


.34 





1 

2 
3 
4 
5 



WHEN 

Answer position 
1 2 3 4 5 



- .19 

.62 - 

.39 .56 

.19 .35 

.13 .16 



.09 
.16 

.46 
.25 



.04 .06 
.07 .04 
.19 .05 

— .17 

.45 - 



1 
2 
3 
4 

5 



CONS 

Answer position 



1 


2 


3 


4 


5 




.35 


.30 


.21 


.30 


.06 




.44 


.24 


.32 


.04 


.13 




• 2 


.40 


.04 


.09 


.15 




.56 


.08 


.04 


.06 


.08 
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Three-way Interaction among Arc Search, Structural Distance, 
and Constraint Satisfaction (Narrative Text) 

, WHY, HOW, ENABLE WHEN, CONS 




2 3 12 3 

Structural Distance Structural Distance 



LEGEND 






Legal answers that satisfy constraints 




—# 


Legal answers that don't satisfy constraints 






niegal answers that satisfy constraints 


o- 


— o 


Illegal answers that don't satisfy constraints 


0- 
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Two-way Interaction Between Arc Search and Constraint 
Satisfaction for Decision Latencies (Narrative Text) 
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